Document (#26509)

Author
Peters, G.
Gaese, V.
Title
¬Das DocCat-System in der Textdokumentation von G+J
Source
Medien-Informationsmanagement: Archivarische, dokumentarische, betriebswirtschaftliche, rechtliche und Berufsbild-Aspekte. Hrsg.: Marianne Englert u.a
Imprint
Münster : LIT Verlag
Year
2003
Pages
S.123-133
Series
Beiträge zur Mediendokumentation; Bd.6
Abstract
Wir werden einmal die Grundlagen des Text-Mining-Systems bei IBM darstellen, dann werden wir das Projekt etwas umfangreicher und deutlicher darstellen, da kennen wir uns aus. Von daher haben wir zwei Teile, einmal Heidelberg, einmal Hamburg. Noch einmal zur Technologie. Text-Mining ist eine von IBM entwickelte Technologie, die in einer besonderen Ausformung und Programmierung für uns zusammengestellt wurde. Das Projekt hieß bei uns lange Zeit DocText Miner und heißt seit einiger Zeit auf Vorschlag von IBM DocCat, das soll eine Abkürzung für Document-Categoriser sein, sie ist ja auch nett und anschaulich. Wir fangen an mit Text-Mining, das bei IBM in Heidelberg entwickelt wurde. Die verstehen darunter das automatische Indexieren als eine Instanz, also einen Teil von Text-Mining. Probleme werden dabei gezeigt, und das Text-Mining ist eben eine Methode zur Strukturierung von und der Suche in großen Dokumentenmengen, die Extraktion von Informationen und, das ist der hohe Anspruch, von impliziten Zusammenhängen. Das letztere sei dahingestellt. IBM macht das quantitativ, empirisch, approximativ und schnell. das muss man wirklich sagen. Das Ziel, und das ist ganz wichtig für unser Projekt gewesen, ist nicht, den Text zu verstehen, sondern das Ergebnis dieser Verfahren ist, was sie auf Neudeutsch a bundle of words, a bag of words nennen, also eine Menge von bedeutungstragenden Begriffen aus einem Text zu extrahieren, aufgrund von Algorithmen, also im Wesentlichen aufgrund von Rechenoperationen. Es gibt eine ganze Menge von linguistischen Vorstudien, ein wenig Linguistik ist auch dabei, aber nicht die Grundlage der ganzen Geschichte. Was sie für uns gemacht haben, ist also die Annotierung von Pressetexten für unsere Pressedatenbank. Für diejenigen, die es noch nicht kennen: Gruner + Jahr führt eine Textdokumentation, die eine Datenbank führt, seit Anfang der 70er Jahre, da sind z.Z. etwa 6,5 Millionen Dokumente darin, davon etwas über 1 Million Volltexte ab 1993. Das Prinzip war lange Zeit, dass wir die Dokumente, die in der Datenbank gespeichert waren und sind, verschlagworten und dieses Prinzip haben wir auch dann, als der Volltext eingeführt wurde, in abgespeckter Form weitergeführt. Zu diesen 6,5 Millionen Dokumenten gehören dann eben auch ungefähr 10 Millionen Faksimileseiten, weil wir die Faksimiles auch noch standardmäßig aufheben.
Theme
Data Mining
Dokumentenmanagement
Object
DocCat

Similar documents (author)

  1. Peters, C.M.: CD-ROM: its potential in libraries (1986) 4.78
    4.7847233 = sum of:
      4.7847233 = weight(author_txt:peters in 535) [ClassicSimilarity], result of:
        4.7847233 = fieldWeight in 535, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.655557 = idf(docFreq=54, maxDocs=42740)
          0.625 = fieldNorm(doc=535)
    
  2. Peters, T.A.: When smart people fail : an analysis of the transaction log of an online public access catalog (1989) 4.78
    4.7847233 = sum of:
      4.7847233 = weight(author_txt:peters in 2283) [ClassicSimilarity], result of:
        4.7847233 = fieldWeight in 2283, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.655557 = idf(docFreq=54, maxDocs=42740)
          0.625 = fieldNorm(doc=2283)
    
  3. Peters, C.M.: CD-ROM and optical technology : the user interface (1988) 4.78
    4.7847233 = sum of:
      4.7847233 = weight(author_txt:peters in 4013) [ClassicSimilarity], result of:
        4.7847233 = fieldWeight in 4013, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.655557 = idf(docFreq=54, maxDocs=42740)
          0.625 = fieldNorm(doc=4013)
    
  4. Peters, B.F.: Online searching using speech as a man / machine interface (1989) 4.78
    4.7847233 = sum of:
      4.7847233 = weight(author_txt:peters in 4637) [ClassicSimilarity], result of:
        4.7847233 = fieldWeight in 4637, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.655557 = idf(docFreq=54, maxDocs=42740)
          0.625 = fieldNorm(doc=4637)
    
  5. Peters, R.: Katalogisierung mit MIDAS (1991) 4.78
    4.7847233 = sum of:
      4.7847233 = weight(author_txt:peters in 4740) [ClassicSimilarity], result of:
        4.7847233 = fieldWeight in 4740, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.655557 = idf(docFreq=54, maxDocs=42740)
          0.625 = fieldNorm(doc=4740)
    

Similar documents (content)

  1. Jörn, F.: Wie Google für uns nach der ominösen Gluonenkraft stöbert : Software-Krabbler machen sich vor der Anfrage auf die Suche - Das Netz ist etwa fünfhundertmal größer als alles Durchforschte (2001) 0.26
    0.26457676 = sum of:
      0.26457676 = product of:
        0.4724585 = sum of:
          0.018527236 = weight(abstract_txt:verstehen in 685) [ClassicSimilarity], result of:
            0.018527236 = score(doc=685,freq=1.0), product of:
              0.14785135 = queryWeight, product of:
                6.4158664 = idf(docFreq=189, maxDocs=42740)
                0.023044642 = queryNorm
              0.12530988 = fieldWeight in 685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4158664 = idf(docFreq=189, maxDocs=42740)
                0.01953125 = fieldNorm(doc=685)
          0.019351175 = weight(abstract_txt:etwas in 685) [ClassicSimilarity], result of:
            0.019351175 = score(doc=685,freq=1.0), product of:
              0.15220296 = queryWeight, product of:
                1.0146095 = boost
                6.5095987 = idf(docFreq=172, maxDocs=42740)
                0.023044642 = queryNorm
              0.1271406 = fieldWeight in 685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5095987 = idf(docFreq=172, maxDocs=42740)
                0.01953125 = fieldNorm(doc=685)
          0.021719905 = weight(abstract_txt:lange in 685) [ClassicSimilarity], result of:
            0.021719905 = score(doc=685,freq=1.0), product of:
              0.16438295 = queryWeight, product of:
                1.0544251 = boost
                6.765051 = idf(docFreq=133, maxDocs=42740)
                0.023044642 = queryNorm
              0.13212991 = fieldWeight in 685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.765051 = idf(docFreq=133, maxDocs=42740)
                0.01953125 = fieldNorm(doc=685)
          0.0056693363 = weight(abstract_txt:also in 685) [ClassicSimilarity], result of:
            0.0056693363 = score(doc=685,freq=1.0), product of:
              0.08458948 = queryWeight, product of:
                1.0696964 = boost
                3.4315145 = idf(docFreq=3756, maxDocs=42740)
                0.023044642 = queryNorm
              0.067021765 = fieldWeight in 685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4315145 = idf(docFreq=3756, maxDocs=42740)
                0.01953125 = fieldNorm(doc=685)
          0.021650314 = weight(abstract_txt:haben in 685) [ClassicSimilarity], result of:
            0.021650314 = score(doc=685,freq=4.0), product of:
              0.11828729 = queryWeight, product of:
                1.0954739 = boost
                4.6856093 = idf(docFreq=1071, maxDocs=42740)
                0.023044642 = queryNorm
              0.18303162 = fieldWeight in 685, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.6856093 = idf(docFreq=1071, maxDocs=42740)
                0.01953125 = fieldNorm(doc=685)
          0.032123096 = weight(abstract_txt:noch in 685) [ClassicSimilarity], result of:
            0.032123096 = score(doc=685,freq=8.0), product of:
              0.12213221 = queryWeight, product of:
                1.1131357 = boost
                4.761153 = idf(docFreq=993, maxDocs=42740)
                0.023044642 = queryNorm
              0.26301903 = fieldWeight in 685, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.761153 = idf(docFreq=993, maxDocs=42740)
                0.01953125 = fieldNorm(doc=685)
          0.011593014 = weight(abstract_txt:wurde in 685) [ClassicSimilarity], result of:
            0.011593014 = score(doc=685,freq=1.0), product of:
              0.12381678 = queryWeight, product of:
                1.1207861 = boost
                4.793876 = idf(docFreq=961, maxDocs=42740)
                0.023044642 = queryNorm
              0.093630396 = fieldWeight in 685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.793876 = idf(docFreq=961, maxDocs=42740)
                0.01953125 = fieldNorm(doc=685)
          0.030178383 = weight(abstract_txt:zeit in 685) [ClassicSimilarity], result of:
            0.030178383 = score(doc=685,freq=3.0), product of:
              0.16245715 = queryWeight, product of:
                1.2838149 = boost
                5.49119 = idf(docFreq=478, maxDocs=42740)
                0.023044642 = queryNorm
              0.1857621 = fieldWeight in 685, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.49119 = idf(docFreq=478, maxDocs=42740)
                0.01953125 = fieldNorm(doc=685)
          0.052506104 = weight(abstract_txt:dann in 685) [ClassicSimilarity], result of:
            0.052506104 = score(doc=685,freq=8.0), product of:
              0.16946961 = queryWeight, product of:
                1.3112301 = boost
                5.6084514 = idf(docFreq=425, maxDocs=42740)
                0.023044642 = queryNorm
              0.30982608 = fieldWeight in 685, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.6084514 = idf(docFreq=425, maxDocs=42740)
                0.01953125 = fieldNorm(doc=685)
          0.024798201 = weight(abstract_txt:auch in 685) [ClassicSimilarity], result of:
            0.024798201 = score(doc=685,freq=7.0), product of:
              0.1274028 = queryWeight, product of:
                1.4677323 = boost
                3.7667098 = idf(docFreq=2686, maxDocs=42740)
                0.023044642 = queryNorm
              0.1946441 = fieldWeight in 685, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.7667098 = idf(docFreq=2686, maxDocs=42740)
                0.01953125 = fieldNorm(doc=685)
          0.09278384 = weight(abstract_txt:millionen in 685) [ClassicSimilarity], result of:
            0.09278384 = score(doc=685,freq=10.0), product of:
              0.22994828 = queryWeight, product of:
                1.5273834 = boost
                6.532992 = idf(docFreq=168, maxDocs=42740)
                0.023044642 = queryNorm
              0.4034987 = fieldWeight in 685, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                6.532992 = idf(docFreq=168, maxDocs=42740)
                0.01953125 = fieldNorm(doc=685)
          0.0795482 = weight(abstract_txt:einmal in 685) [ClassicSimilarity], result of:
            0.0795482 = score(doc=685,freq=4.0), product of:
              0.31000006 = queryWeight, product of:
                2.0477798 = boost
                6.5691404 = idf(docFreq=162, maxDocs=42740)
                0.023044642 = queryNorm
              0.25660706 = fieldWeight in 685, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.5691404 = idf(docFreq=162, maxDocs=42740)
                0.01953125 = fieldNorm(doc=685)
          0.03894158 = weight(abstract_txt:eine in 685) [ClassicSimilarity], result of:
            0.03894158 = score(doc=685,freq=10.0), product of:
              0.1787505 = queryWeight, product of:
                2.1990798 = boost
                3.5272505 = idf(docFreq=3413, maxDocs=42740)
                0.023044642 = queryNorm
              0.2178544 = fieldWeight in 685, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                3.5272505 = idf(docFreq=3413, maxDocs=42740)
                0.01953125 = fieldNorm(doc=685)
          0.023068106 = weight(abstract_txt:text in 685) [ClassicSimilarity], result of:
            0.023068106 = score(doc=685,freq=2.0), product of:
              0.20620799 = queryWeight, product of:
                2.2093987 = boost
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.023044642 = queryNorm
              0.11186814 = fieldWeight in 685, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.01953125 = fieldNorm(doc=685)
        0.56 = coord(14/25)
    
  2. Arns, C.: Fallstricke Online : Über die eigenen Worte gestolpert (2005) 0.20
    0.20323746 = sum of:
      0.20323746 = product of:
        0.63511705 = sum of:
          0.061923765 = weight(abstract_txt:etwas in 4503) [ClassicSimilarity], result of:
            0.061923765 = score(doc=4503,freq=1.0), product of:
              0.15220296 = queryWeight, product of:
                1.0146095 = boost
                6.5095987 = idf(docFreq=172, maxDocs=42740)
                0.023044642 = queryNorm
              0.40684992 = fieldWeight in 4503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5095987 = idf(docFreq=172, maxDocs=42740)
                0.0625 = fieldNorm(doc=4503)
          0.034640502 = weight(abstract_txt:haben in 4503) [ClassicSimilarity], result of:
            0.034640502 = score(doc=4503,freq=1.0), product of:
              0.11828729 = queryWeight, product of:
                1.0954739 = boost
                4.6856093 = idf(docFreq=1071, maxDocs=42740)
                0.023044642 = queryNorm
              0.29285058 = fieldWeight in 4503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6856093 = idf(docFreq=1071, maxDocs=42740)
                0.0625 = fieldNorm(doc=4503)
          0.07925779 = weight(abstract_txt:menge in 4503) [ClassicSimilarity], result of:
            0.07925779 = score(doc=4503,freq=1.0), product of:
              0.17942359 = queryWeight, product of:
                1.1016082 = boost
                7.0677705 = idf(docFreq=98, maxDocs=42740)
                0.023044642 = queryNorm
              0.44173566 = fieldWeight in 4503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0677705 = idf(docFreq=98, maxDocs=42740)
                0.0625 = fieldNorm(doc=4503)
          0.036343135 = weight(abstract_txt:noch in 4503) [ClassicSimilarity], result of:
            0.036343135 = score(doc=4503,freq=1.0), product of:
              0.12213221 = queryWeight, product of:
                1.1131357 = boost
                4.761153 = idf(docFreq=993, maxDocs=42740)
                0.023044642 = queryNorm
              0.29757208 = fieldWeight in 4503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.761153 = idf(docFreq=993, maxDocs=42740)
                0.0625 = fieldNorm(doc=4503)
          0.102890536 = weight(abstract_txt:dann in 4503) [ClassicSimilarity], result of:
            0.102890536 = score(doc=4503,freq=3.0), product of:
              0.16946961 = queryWeight, product of:
                1.3112301 = boost
                5.6084514 = idf(docFreq=425, maxDocs=42740)
                0.023044642 = queryNorm
              0.6071327 = fieldWeight in 4503, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.6084514 = idf(docFreq=425, maxDocs=42740)
                0.0625 = fieldNorm(doc=4503)
          0.051949546 = weight(abstract_txt:auch in 4503) [ClassicSimilarity], result of:
            0.051949546 = score(doc=4503,freq=3.0), product of:
              0.1274028 = queryWeight, product of:
                1.4677323 = boost
                3.7667098 = idf(docFreq=2686, maxDocs=42740)
                0.023044642 = queryNorm
              0.4077583 = fieldWeight in 4503, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.7667098 = idf(docFreq=2686, maxDocs=42740)
                0.0625 = fieldNorm(doc=4503)
          0.17999704 = weight(abstract_txt:einmal in 4503) [ClassicSimilarity], result of:
            0.17999704 = score(doc=4503,freq=2.0), product of:
              0.31000006 = queryWeight, product of:
                2.0477798 = boost
                6.5691404 = idf(docFreq=162, maxDocs=42740)
                0.023044642 = queryNorm
              0.5806355 = fieldWeight in 4503, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5691404 = idf(docFreq=162, maxDocs=42740)
                0.0625 = fieldNorm(doc=4503)
          0.088114746 = weight(abstract_txt:eine in 4503) [ClassicSimilarity], result of:
            0.088114746 = score(doc=4503,freq=5.0), product of:
              0.1787505 = queryWeight, product of:
                2.1990798 = boost
                3.5272505 = idf(docFreq=3413, maxDocs=42740)
                0.023044642 = queryNorm
              0.49294826 = fieldWeight in 4503, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.5272505 = idf(docFreq=3413, maxDocs=42740)
                0.0625 = fieldNorm(doc=4503)
        0.32 = coord(8/25)
    
  3. Erben, K.M.: ¬Das Internet wird menschlich : Web-Guides sind die neuen Pfadfinder im Dschungel des Netzes (2001) 0.20
    0.19621363 = sum of:
      0.19621363 = product of:
        0.49053407 = sum of:
          0.04343981 = weight(abstract_txt:lange in 651) [ClassicSimilarity], result of:
            0.04343981 = score(doc=651,freq=1.0), product of:
              0.16438295 = queryWeight, product of:
                1.0544251 = boost
                6.765051 = idf(docFreq=133, maxDocs=42740)
                0.023044642 = queryNorm
              0.26425982 = fieldWeight in 651, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.765051 = idf(docFreq=133, maxDocs=42740)
                0.0390625 = fieldNorm(doc=651)
          0.030618165 = weight(abstract_txt:haben in 651) [ClassicSimilarity], result of:
            0.030618165 = score(doc=651,freq=2.0), product of:
              0.11828729 = queryWeight, product of:
                1.0954739 = boost
                4.6856093 = idf(docFreq=1071, maxDocs=42740)
                0.023044642 = queryNorm
              0.25884578 = fieldWeight in 651, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6856093 = idf(docFreq=1071, maxDocs=42740)
                0.0390625 = fieldNorm(doc=651)
          0.039342597 = weight(abstract_txt:noch in 651) [ClassicSimilarity], result of:
            0.039342597 = score(doc=651,freq=3.0), product of:
              0.12213221 = queryWeight, product of:
                1.1131357 = boost
                4.761153 = idf(docFreq=993, maxDocs=42740)
                0.023044642 = queryNorm
              0.32213122 = fieldWeight in 651, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.761153 = idf(docFreq=993, maxDocs=42740)
                0.0390625 = fieldNorm(doc=651)
          0.05501043 = weight(abstract_txt:eben in 651) [ClassicSimilarity], result of:
            0.05501043 = score(doc=651,freq=1.0), product of:
              0.19241026 = queryWeight, product of:
                1.1407789 = boost
                7.319085 = idf(docFreq=76, maxDocs=42740)
                0.023044642 = queryNorm
              0.28590176 = fieldWeight in 651, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.319085 = idf(docFreq=76, maxDocs=42740)
                0.0390625 = fieldNorm(doc=651)
          0.08602547 = weight(abstract_txt:kennen in 651) [ClassicSimilarity], result of:
            0.08602547 = score(doc=651,freq=2.0), product of:
              0.20574987 = queryWeight, product of:
                1.1796608 = boost
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.023044642 = queryNorm
              0.41810703 = fieldWeight in 651, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.0390625 = fieldNorm(doc=651)
          0.034846995 = weight(abstract_txt:zeit in 651) [ClassicSimilarity], result of:
            0.034846995 = score(doc=651,freq=1.0), product of:
              0.16245715 = queryWeight, product of:
                1.2838149 = boost
                5.49119 = idf(docFreq=478, maxDocs=42740)
                0.023044642 = queryNorm
              0.21449961 = fieldWeight in 651, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.49119 = idf(docFreq=478, maxDocs=42740)
                0.0390625 = fieldNorm(doc=651)
          0.037127424 = weight(abstract_txt:dann in 651) [ClassicSimilarity], result of:
            0.037127424 = score(doc=651,freq=1.0), product of:
              0.16946961 = queryWeight, product of:
                1.3112301 = boost
                5.6084514 = idf(docFreq=425, maxDocs=42740)
                0.023044642 = queryNorm
              0.21908014 = fieldWeight in 651, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6084514 = idf(docFreq=425, maxDocs=42740)
                0.0390625 = fieldNorm(doc=651)
          0.04191661 = weight(abstract_txt:auch in 651) [ClassicSimilarity], result of:
            0.04191661 = score(doc=651,freq=5.0), product of:
              0.1274028 = queryWeight, product of:
                1.4677323 = boost
                3.7667098 = idf(docFreq=2686, maxDocs=42740)
                0.023044642 = queryNorm
              0.32900855 = fieldWeight in 651, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.7667098 = idf(docFreq=2686, maxDocs=42740)
                0.0390625 = fieldNorm(doc=651)
          0.0795482 = weight(abstract_txt:einmal in 651) [ClassicSimilarity], result of:
            0.0795482 = score(doc=651,freq=1.0), product of:
              0.31000006 = queryWeight, product of:
                2.0477798 = boost
                6.5691404 = idf(docFreq=162, maxDocs=42740)
                0.023044642 = queryNorm
              0.25660706 = fieldWeight in 651, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5691404 = idf(docFreq=162, maxDocs=42740)
                0.0390625 = fieldNorm(doc=651)
          0.042658366 = weight(abstract_txt:eine in 651) [ClassicSimilarity], result of:
            0.042658366 = score(doc=651,freq=3.0), product of:
              0.1787505 = queryWeight, product of:
                2.1990798 = boost
                3.5272505 = idf(docFreq=3413, maxDocs=42740)
                0.023044642 = queryNorm
              0.23864754 = fieldWeight in 651, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.5272505 = idf(docFreq=3413, maxDocs=42740)
                0.0390625 = fieldNorm(doc=651)
        0.4 = coord(10/25)
    
  4. Taglinger, H.: Ausgevogelt, jetzt wird es ernst (2018) 0.18
    0.18016656 = sum of:
      0.18016656 = product of:
        0.50046265 = sum of:
          0.05187626 = weight(abstract_txt:verstehen in 282) [ClassicSimilarity], result of:
            0.05187626 = score(doc=282,freq=1.0), product of:
              0.14785135 = queryWeight, product of:
                6.4158664 = idf(docFreq=189, maxDocs=42740)
                0.023044642 = queryNorm
              0.3508677 = fieldWeight in 282, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4158664 = idf(docFreq=189, maxDocs=42740)
                0.0546875 = fieldNorm(doc=282)
          0.022449428 = weight(abstract_txt:also in 282) [ClassicSimilarity], result of:
            0.022449428 = score(doc=282,freq=2.0), product of:
              0.08458948 = queryWeight, product of:
                1.0696964 = boost
                3.4315145 = idf(docFreq=3756, maxDocs=42740)
                0.023044642 = queryNorm
              0.26539266 = fieldWeight in 282, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4315145 = idf(docFreq=3756, maxDocs=42740)
                0.0546875 = fieldNorm(doc=282)
          0.06935057 = weight(abstract_txt:menge in 282) [ClassicSimilarity], result of:
            0.06935057 = score(doc=282,freq=1.0), product of:
              0.17942359 = queryWeight, product of:
                1.1016082 = boost
                7.0677705 = idf(docFreq=98, maxDocs=42740)
                0.023044642 = queryNorm
              0.3865187 = fieldWeight in 282, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0677705 = idf(docFreq=98, maxDocs=42740)
                0.0546875 = fieldNorm(doc=282)
          0.05507964 = weight(abstract_txt:noch in 282) [ClassicSimilarity], result of:
            0.05507964 = score(doc=282,freq=3.0), product of:
              0.12213221 = queryWeight, product of:
                1.1131357 = boost
                4.761153 = idf(docFreq=993, maxDocs=42740)
                0.023044642 = queryNorm
              0.4509837 = fieldWeight in 282, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.761153 = idf(docFreq=993, maxDocs=42740)
                0.0546875 = fieldNorm(doc=282)
          0.07701461 = weight(abstract_txt:eben in 282) [ClassicSimilarity], result of:
            0.07701461 = score(doc=282,freq=1.0), product of:
              0.19241026 = queryWeight, product of:
                1.1407789 = boost
                7.319085 = idf(docFreq=76, maxDocs=42740)
                0.023044642 = queryNorm
              0.40026248 = fieldWeight in 282, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.319085 = idf(docFreq=76, maxDocs=42740)
                0.0546875 = fieldNorm(doc=282)
          0.048785795 = weight(abstract_txt:zeit in 282) [ClassicSimilarity], result of:
            0.048785795 = score(doc=282,freq=1.0), product of:
              0.16245715 = queryWeight, product of:
                1.2838149 = boost
                5.49119 = idf(docFreq=478, maxDocs=42740)
                0.023044642 = queryNorm
              0.30029947 = fieldWeight in 282, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.49119 = idf(docFreq=478, maxDocs=42740)
                0.0546875 = fieldNorm(doc=282)
          0.09002922 = weight(abstract_txt:dann in 282) [ClassicSimilarity], result of:
            0.09002922 = score(doc=282,freq=3.0), product of:
              0.16946961 = queryWeight, product of:
                1.3112301 = boost
                5.6084514 = idf(docFreq=425, maxDocs=42740)
                0.023044642 = queryNorm
              0.53124106 = fieldWeight in 282, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.6084514 = idf(docFreq=425, maxDocs=42740)
                0.0546875 = fieldNorm(doc=282)
          0.03711455 = weight(abstract_txt:auch in 282) [ClassicSimilarity], result of:
            0.03711455 = score(doc=282,freq=2.0), product of:
              0.1274028 = queryWeight, product of:
                1.4677323 = boost
                3.7667098 = idf(docFreq=2686, maxDocs=42740)
                0.023044642 = queryNorm
              0.2913166 = fieldWeight in 282, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7667098 = idf(docFreq=2686, maxDocs=42740)
                0.0546875 = fieldNorm(doc=282)
          0.04876258 = weight(abstract_txt:eine in 282) [ClassicSimilarity], result of:
            0.04876258 = score(doc=282,freq=2.0), product of:
              0.1787505 = queryWeight, product of:
                2.1990798 = boost
                3.5272505 = idf(docFreq=3413, maxDocs=42740)
                0.023044642 = queryNorm
              0.27279687 = fieldWeight in 282, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5272505 = idf(docFreq=3413, maxDocs=42740)
                0.0546875 = fieldNorm(doc=282)
        0.36 = coord(9/25)
    
  5. Heyer, G.; Quasthoff, U.; Wittig, T.: Text Mining : Wissensrohstoff Text. Konzepte, Algorithmen, Ergebnisse (2006) 0.17
    0.16786098 = sum of:
      0.16786098 = product of:
        0.5995035 = sum of:
          0.06143316 = weight(abstract_txt:lange in 219) [ClassicSimilarity], result of:
            0.06143316 = score(doc=219,freq=2.0), product of:
              0.16438295 = queryWeight, product of:
                1.0544251 = boost
                6.765051 = idf(docFreq=133, maxDocs=42740)
                0.023044642 = queryNorm
              0.37371978 = fieldWeight in 219, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.765051 = idf(docFreq=133, maxDocs=42740)
                0.0390625 = fieldNorm(doc=219)
          0.07523995 = weight(abstract_txt:technologie in 219) [ClassicSimilarity], result of:
            0.07523995 = score(doc=219,freq=3.0), product of:
              0.16438295 = queryWeight, product of:
                1.0544251 = boost
                6.765051 = idf(docFreq=133, maxDocs=42740)
                0.023044642 = queryNorm
              0.4577114 = fieldWeight in 219, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.765051 = idf(docFreq=133, maxDocs=42740)
                0.0390625 = fieldNorm(doc=219)
          0.032123096 = weight(abstract_txt:noch in 219) [ClassicSimilarity], result of:
            0.032123096 = score(doc=219,freq=2.0), product of:
              0.12213221 = queryWeight, product of:
                1.1131357 = boost
                4.761153 = idf(docFreq=993, maxDocs=42740)
                0.023044642 = queryNorm
              0.26301903 = fieldWeight in 219, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.761153 = idf(docFreq=993, maxDocs=42740)
                0.0390625 = fieldNorm(doc=219)
          0.026510391 = weight(abstract_txt:auch in 219) [ClassicSimilarity], result of:
            0.026510391 = score(doc=219,freq=2.0), product of:
              0.1274028 = queryWeight, product of:
                1.4677323 = boost
                3.7667098 = idf(docFreq=2686, maxDocs=42740)
                0.023044642 = queryNorm
              0.20808327 = fieldWeight in 219, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7667098 = idf(docFreq=2686, maxDocs=42740)
                0.0390625 = fieldNorm(doc=219)
          0.049257644 = weight(abstract_txt:eine in 219) [ClassicSimilarity], result of:
            0.049257644 = score(doc=219,freq=4.0), product of:
              0.1787505 = queryWeight, product of:
                2.1990798 = boost
                3.5272505 = idf(docFreq=3413, maxDocs=42740)
                0.023044642 = queryNorm
              0.27556646 = fieldWeight in 219, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.5272505 = idf(docFreq=3413, maxDocs=42740)
                0.0390625 = fieldNorm(doc=219)
          0.117624715 = weight(abstract_txt:text in 219) [ClassicSimilarity], result of:
            0.117624715 = score(doc=219,freq=13.0), product of:
              0.20620799 = queryWeight, product of:
                2.2093987 = boost
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.023044642 = queryNorm
              0.5704178 = fieldWeight in 219, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.0390625 = fieldNorm(doc=219)
          0.2373146 = weight(abstract_txt:mining in 219) [ClassicSimilarity], result of:
            0.2373146 = score(doc=219,freq=8.0), product of:
              0.34601733 = queryWeight, product of:
                2.4188352 = boost
                6.2075696 = idf(docFreq=233, maxDocs=42740)
                0.023044642 = queryNorm
              0.685846 = fieldWeight in 219, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.2075696 = idf(docFreq=233, maxDocs=42740)
                0.0390625 = fieldNorm(doc=219)
        0.28 = coord(7/25)