Document (#37904)

Author
Kempf, A.O.
Title
Automatische Inhaltserschließung in der Fachinformation
Source
Information - Wissenschaft und Praxis. 64(2013) H.2/3, S.96-106
Year
2013
Abstract
Der Artikel basiert auf einer Masterarbeit mit dem Titel "Automatische Indexierung in der sozialwissenschaftlichen Fachinformation. Eine Evaluationsstudie zur maschinellen Erschließung für die Datenbank SOLIS" (Kempf 2012), die im Rahmen des Aufbaustudiengangs Bibliotheks- und Informationswissenschaft an der Humboldt- Universität zu Berlin am Lehrstuhl Information Retrieval verfasst wurde. Auf der Grundlage des Schalenmodells zur Inhaltserschließung in der Fachinformation stellt der Artikel Evaluationsergebnisse eines automatischen Erschließungsverfahrens für den Einsatz in der sozialwissenschaftlichen Fachinformation vor. Ausgehend von dem von Krause beschriebenen Anwendungsszenario, wonach SOLIS-Datenbestände (Sozialwissenschaftliches Literaturinformationssystem) von geringerer Relevanz automatisch erschlossen werden sollten, wurden auf dieser Dokumentgrundlage zwei Testreihen mit der Indexierungssoftware MindServer der Firma Recommind durchgeführt. Neben den Auswirkungen allgemeiner Systemeinstellungen in der ersten Testreihe wurde in der zweiten Testreihe die Indexierungsleistung der Software für die Rand- und die Kernbereiche der Literaturdatenbank miteinander verglichen. Für letztere Testreihe wurden für beide Bereiche der Datenbank spezifische Versionen der Indexierungssoftware aufgebaut, die anhand von Dokumentkorpora aus den entsprechenden Bereichen trainiert wurden. Die Ergebnisse der Evaluation, die auf der Grundlage intellektuell generierter Vergleichsdaten erfolgt, weisen auf Unterschiede in der Indexierungsleistung zwischen Rand- und Kernbereichen hin, die einerseits gegen den Einsatz automatischer Indexierungsverfahren in den Randbereichen sprechen. Andererseits deutet sich an, dass sich die Indexierungsresultate durch den Aufbau fachteilgebietsspezifischer Trainingsmengen verbessern lassen.
Content
Vgl.: http://www.degruyter.com/view/j/iwp.2013.64.issue-2-3/iwp-2013-0011/iwp-2013-0011.xml?format=INT.
Theme
Automatisches Indexieren
Field
Sozialwissenschaften
Object
SOLIS

Similar documents (author)

  1. Kempf, G.: Klassifikationsprobleme der Rechtswissenschaft (1972) 5.24
    5.241229 = sum of:
      5.241229 = weight(author_txt:kempf in 4740) [ClassicSimilarity], result of:
        5.241229 = fieldWeight in 4740, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.385966 = idf(docFreq=26, maxDocs=43556)
          0.625 = fieldNorm(doc=4740)
    
  2. Kempf, A.: Thematischer Zugang zu Fachinformationen im Internet (1994) 5.24
    5.241229 = sum of:
      5.241229 = weight(author_txt:kempf in 972) [ClassicSimilarity], result of:
        5.241229 = fieldWeight in 972, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.385966 = idf(docFreq=26, maxDocs=43556)
          0.625 = fieldNorm(doc=972)
    
  3. Kempf, A.: Forstliche Klassifikation und Meta-Information zum Wald im Internet (1995) 5.24
    5.241229 = sum of:
      5.241229 = weight(author_txt:kempf in 3270) [ClassicSimilarity], result of:
        5.241229 = fieldWeight in 3270, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.385966 = idf(docFreq=26, maxDocs=43556)
          0.625 = fieldNorm(doc=3270)
    
  4. Kempf, A.: Advocating global forest issues on the Internet (1996) 5.24
    5.241229 = sum of:
      5.241229 = weight(author_txt:kempf in 91) [ClassicSimilarity], result of:
        5.241229 = fieldWeight in 91, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.385966 = idf(docFreq=26, maxDocs=43556)
          0.625 = fieldNorm(doc=91)
    
  5. Kempf, K.: Dalla Germania un esempio avanzato di sistema integrato (1997) 5.24
    5.241229 = sum of:
      5.241229 = weight(author_txt:kempf in 844) [ClassicSimilarity], result of:
        5.241229 = fieldWeight in 844, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.385966 = idf(docFreq=26, maxDocs=43556)
          0.625 = fieldNorm(doc=844)
    

Similar documents (content)

  1. Kempf, A.O.: Automatische Indexierung in der sozialwissenschaftlichen Fachinformation : eine Evaluationsstudie zur maschinellen Erschließung für die Datenbank SOLIS (2012) 0.49
    0.4936541 = sum of:
      0.4936541 = product of:
        1.7630503 = sum of:
          0.11025981 = weight(abstract_txt:literaturdatenbank in 2901) [ClassicSimilarity], result of:
            0.11025981 = score(doc=2901,freq=1.0), product of:
              0.1572726 = queryWeight, product of:
                1.0837073 = boost
                8.973753 = idf(docFreq=14, maxDocs=43556)
                0.01617212 = queryNorm
              0.7010745 = fieldWeight in 2901, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.973753 = idf(docFreq=14, maxDocs=43556)
                0.078125 = fieldNorm(doc=2901)
          0.11561917 = weight(abstract_txt:indexierungsverfahren in 2901) [ClassicSimilarity], result of:
            0.11561917 = score(doc=2901,freq=1.0), product of:
              0.16232853 = queryWeight, product of:
                1.1009887 = boost
                9.116854 = idf(docFreq=12, maxDocs=43556)
                0.01617212 = queryNorm
              0.71225417 = fieldWeight in 2901, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.116854 = idf(docFreq=12, maxDocs=43556)
                0.078125 = fieldNorm(doc=2901)
          0.070927404 = weight(abstract_txt:datenbank in 2901) [ClassicSimilarity], result of:
            0.070927404 = score(doc=2901,freq=1.0), product of:
              0.14765936 = queryWeight, product of:
                1.4850154 = boost
                6.148413 = idf(docFreq=252, maxDocs=43556)
                0.01617212 = queryNorm
              0.48034477 = fieldWeight in 2901, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.148413 = idf(docFreq=252, maxDocs=43556)
                0.078125 = fieldNorm(doc=2901)
          0.14351706 = weight(abstract_txt:automatische in 2901) [ClassicSimilarity], result of:
            0.14351706 = score(doc=2901,freq=2.0), product of:
              0.18748966 = queryWeight, product of:
                1.673359 = boost
                6.9282126 = idf(docFreq=115, maxDocs=43556)
                0.01617212 = queryNorm
              0.7654666 = fieldWeight in 2901, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9282126 = idf(docFreq=115, maxDocs=43556)
                0.078125 = fieldNorm(doc=2901)
          0.21142043 = weight(abstract_txt:sozialwissenschaftlichen in 2901) [ClassicSimilarity], result of:
            0.21142043 = score(doc=2901,freq=1.0), product of:
              0.305832 = queryWeight, product of:
                2.1371841 = boost
                8.848589 = idf(docFreq=16, maxDocs=43556)
                0.01617212 = queryNorm
              0.691296 = fieldWeight in 2901, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.848589 = idf(docFreq=16, maxDocs=43556)
                0.078125 = fieldNorm(doc=2901)
          0.3270204 = weight(abstract_txt:solis in 2901) [ClassicSimilarity], result of:
            0.3270204 = score(doc=2901,freq=2.0), product of:
              0.32465705 = queryWeight, product of:
                2.2019775 = boost
                9.116854 = idf(docFreq=12, maxDocs=43556)
                0.01617212 = queryNorm
              1.0072795 = fieldWeight in 2901, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.116854 = idf(docFreq=12, maxDocs=43556)
                0.078125 = fieldNorm(doc=2901)
          0.784286 = weight(title_txt:fachinformation in 2901) [ClassicSimilarity], result of:
            0.784286 = score(doc=2901,freq=1.0), product of:
              0.42521763 = queryWeight, product of:
                3.5638638 = boost
                7.3777375 = idf(docFreq=73, maxDocs=43556)
                0.01617212 = queryNorm
              1.8444344 = fieldWeight in 2901, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3777375 = idf(docFreq=73, maxDocs=43556)
                0.25 = fieldNorm(doc=2901)
        0.28 = coord(7/25)
    
  2. Seeger, T.: Entwicklung der Fachinformation und -kommunikation (2004) 0.18
    0.17621237 = sum of:
      0.17621237 = product of:
        1.4684365 = sum of:
          0.06930605 = weight(abstract_txt:beschriebenen in 3905) [ClassicSimilarity], result of:
            0.06930605 = score(doc=3905,freq=1.0), product of:
              0.13391495 = queryWeight, product of:
                8.280605 = idf(docFreq=29, maxDocs=43556)
                0.01617212 = queryNorm
              0.51753783 = fieldWeight in 3905, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.280605 = idf(docFreq=29, maxDocs=43556)
                0.0625 = fieldNorm(doc=3905)
          0.026629837 = weight(abstract_txt:wurde in 3905) [ClassicSimilarity], result of:
            0.026629837 = score(doc=3905,freq=1.0), product of:
              0.0891738 = queryWeight, product of:
                1.1540353 = boost
                4.7780557 = idf(docFreq=995, maxDocs=43556)
                0.01617212 = queryNorm
              0.29862848 = fieldWeight in 3905, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7780557 = idf(docFreq=995, maxDocs=43556)
                0.0625 = fieldNorm(doc=3905)
          1.3725005 = weight(title_txt:fachinformation in 3905) [ClassicSimilarity], result of:
            1.3725005 = score(doc=3905,freq=1.0), product of:
              0.42521763 = queryWeight, product of:
                3.5638638 = boost
                7.3777375 = idf(docFreq=73, maxDocs=43556)
                0.01617212 = queryNorm
              3.22776 = fieldWeight in 3905, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3777375 = idf(docFreq=73, maxDocs=43556)
                0.4375 = fieldNorm(doc=3905)
        0.12 = coord(3/25)
    
  3. Capurro, R.: Hermeneutik der Fachinformation (1986) 0.13
    0.12655096 = sum of:
      0.12655096 = product of:
        1.581887 = sum of:
          0.013314919 = weight(abstract_txt:wurde in 611) [ClassicSimilarity], result of:
            0.013314919 = score(doc=611,freq=1.0), product of:
              0.0891738 = queryWeight, product of:
                1.1540353 = boost
                4.7780557 = idf(docFreq=995, maxDocs=43556)
                0.01617212 = queryNorm
              0.14931424 = fieldWeight in 611, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7780557 = idf(docFreq=995, maxDocs=43556)
                0.03125 = fieldNorm(doc=611)
          1.568572 = weight(title_txt:fachinformation in 611) [ClassicSimilarity], result of:
            1.568572 = score(doc=611,freq=1.0), product of:
              0.42521763 = queryWeight, product of:
                3.5638638 = boost
                7.3777375 = idf(docFreq=73, maxDocs=43556)
                0.01617212 = queryNorm
              3.6888688 = fieldWeight in 611, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3777375 = idf(docFreq=73, maxDocs=43556)
                0.5 = fieldNorm(doc=611)
        0.08 = coord(2/25)
    
  4. Herb, U.: Wege zur psychologischen Fachinformation : Eine Bilanz aus der Virtuellen Fachbibliothek Psychologie (2002) 0.11
    0.10603364 = sum of:
      0.10603364 = product of:
        0.88361365 = sum of:
          0.026629837 = weight(abstract_txt:wurde in 2175) [ClassicSimilarity], result of:
            0.026629837 = score(doc=2175,freq=1.0), product of:
              0.0891738 = queryWeight, product of:
                1.1540353 = boost
                4.7780557 = idf(docFreq=995, maxDocs=43556)
                0.01617212 = queryNorm
              0.29862848 = fieldWeight in 2175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7780557 = idf(docFreq=995, maxDocs=43556)
                0.0625 = fieldNorm(doc=2175)
          0.072697796 = weight(abstract_txt:wurden in 2175) [ClassicSimilarity], result of:
            0.072697796 = score(doc=2175,freq=2.0), product of:
              0.15825576 = queryWeight, product of:
                1.882894 = boost
                5.1971674 = idf(docFreq=654, maxDocs=43556)
                0.01617212 = queryNorm
              0.45936903 = fieldWeight in 2175, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1971674 = idf(docFreq=654, maxDocs=43556)
                0.0625 = fieldNorm(doc=2175)
          0.784286 = weight(title_txt:fachinformation in 2175) [ClassicSimilarity], result of:
            0.784286 = score(doc=2175,freq=1.0), product of:
              0.42521763 = queryWeight, product of:
                3.5638638 = boost
                7.3777375 = idf(docFreq=73, maxDocs=43556)
                0.01617212 = queryNorm
              1.8444344 = fieldWeight in 2175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3777375 = idf(docFreq=73, maxDocs=43556)
                0.25 = fieldNorm(doc=2175)
        0.12 = coord(3/25)
    
  5. Habermann, K.: vifamath - mathematische Fachinformation aus einer Hand (2010) 0.10
    0.10233913 = sum of:
      0.10233913 = product of:
        1.2792392 = sum of:
          0.102810204 = weight(abstract_txt:wurden in 2035) [ClassicSimilarity], result of:
            0.102810204 = score(doc=2035,freq=1.0), product of:
              0.15825576 = queryWeight, product of:
                1.882894 = boost
                5.1971674 = idf(docFreq=654, maxDocs=43556)
                0.01617212 = queryNorm
              0.6496459 = fieldWeight in 2035, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1971674 = idf(docFreq=654, maxDocs=43556)
                0.125 = fieldNorm(doc=2035)
          1.176429 = weight(title_txt:fachinformation in 2035) [ClassicSimilarity], result of:
            1.176429 = score(doc=2035,freq=1.0), product of:
              0.42521763 = queryWeight, product of:
                3.5638638 = boost
                7.3777375 = idf(docFreq=73, maxDocs=43556)
                0.01617212 = queryNorm
              2.7666516 = fieldWeight in 2035, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3777375 = idf(docFreq=73, maxDocs=43556)
                0.375 = fieldNorm(doc=2035)
        0.08 = coord(2/25)