Document (#26508)

Author
Peters, G.
Gaese, V.
Title
¬Das DocCat-System in der Textdokumentation von G+J
Source
Medien-Informationsmanagement: Archivarische, dokumentarische, betriebswirtschaftliche, rechtliche und Berufsbild-Aspekte. Hrsg.: Marianne Englert u.a
Imprint
Münster : LIT Verlag
Year
2003
Pages
S.123-133
Series
Beiträge zur Mediendokumentation; Bd.6
Abstract
Wir werden einmal die Grundlagen des Text-Mining-Systems bei IBM darstellen, dann werden wir das Projekt etwas umfangreicher und deutlicher darstellen, da kennen wir uns aus. Von daher haben wir zwei Teile, einmal Heidelberg, einmal Hamburg. Noch einmal zur Technologie. Text-Mining ist eine von IBM entwickelte Technologie, die in einer besonderen Ausformung und Programmierung für uns zusammengestellt wurde. Das Projekt hieß bei uns lange Zeit DocText Miner und heißt seit einiger Zeit auf Vorschlag von IBM DocCat, das soll eine Abkürzung für Document-Categoriser sein, sie ist ja auch nett und anschaulich. Wir fangen an mit Text-Mining, das bei IBM in Heidelberg entwickelt wurde. Die verstehen darunter das automatische Indexieren als eine Instanz, also einen Teil von Text-Mining. Probleme werden dabei gezeigt, und das Text-Mining ist eben eine Methode zur Strukturierung von und der Suche in großen Dokumentenmengen, die Extraktion von Informationen und, das ist der hohe Anspruch, von impliziten Zusammenhängen. Das letztere sei dahingestellt. IBM macht das quantitativ, empirisch, approximativ und schnell. das muss man wirklich sagen. Das Ziel, und das ist ganz wichtig für unser Projekt gewesen, ist nicht, den Text zu verstehen, sondern das Ergebnis dieser Verfahren ist, was sie auf Neudeutsch a bundle of words, a bag of words nennen, also eine Menge von bedeutungstragenden Begriffen aus einem Text zu extrahieren, aufgrund von Algorithmen, also im Wesentlichen aufgrund von Rechenoperationen. Es gibt eine ganze Menge von linguistischen Vorstudien, ein wenig Linguistik ist auch dabei, aber nicht die Grundlage der ganzen Geschichte. Was sie für uns gemacht haben, ist also die Annotierung von Pressetexten für unsere Pressedatenbank. Für diejenigen, die es noch nicht kennen: Gruner + Jahr führt eine Textdokumentation, die eine Datenbank führt, seit Anfang der 70er Jahre, da sind z.Z. etwa 6,5 Millionen Dokumente darin, davon etwas über 1 Million Volltexte ab 1993. Das Prinzip war lange Zeit, dass wir die Dokumente, die in der Datenbank gespeichert waren und sind, verschlagworten und dieses Prinzip haben wir auch dann, als der Volltext eingeführt wurde, in abgespeckter Form weitergeführt. Zu diesen 6,5 Millionen Dokumenten gehören dann eben auch ungefähr 10 Millionen Faksimileseiten, weil wir die Faksimiles auch noch standardmäßig aufheben.
Theme
Data Mining
Dokumentenmanagement
Object
DocCat

Similar documents (author)

  1. Peters, C.M.: CD-ROM: its potential in libraries (1986) 4.76
    4.7620935 = sum of:
      4.7620935 = weight(author_txt:peters in 535) [ClassicSimilarity], result of:
        4.7620935 = fieldWeight in 535, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.61935 = idf(docFreq=58, maxDocs=44218)
          0.625 = fieldNorm(doc=535)
    
  2. Peters, T.A.: When smart people fail : an analysis of the transaction log of an online public access catalog (1989) 4.76
    4.7620935 = sum of:
      4.7620935 = weight(author_txt:peters in 2283) [ClassicSimilarity], result of:
        4.7620935 = fieldWeight in 2283, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.61935 = idf(docFreq=58, maxDocs=44218)
          0.625 = fieldNorm(doc=2283)
    
  3. Peters, C.M.: CD-ROM and optical technology : the user interface (1988) 4.76
    4.7620935 = sum of:
      4.7620935 = weight(author_txt:peters in 4013) [ClassicSimilarity], result of:
        4.7620935 = fieldWeight in 4013, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.61935 = idf(docFreq=58, maxDocs=44218)
          0.625 = fieldNorm(doc=4013)
    
  4. Peters, B.F.: Online searching using speech as a man / machine interface (1989) 4.76
    4.7620935 = sum of:
      4.7620935 = weight(author_txt:peters in 4637) [ClassicSimilarity], result of:
        4.7620935 = fieldWeight in 4637, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.61935 = idf(docFreq=58, maxDocs=44218)
          0.625 = fieldNorm(doc=4637)
    
  5. Peters, R.: Katalogisierung mit MIDAS (1991) 4.76
    4.7620935 = sum of:
      4.7620935 = weight(author_txt:peters in 4740) [ClassicSimilarity], result of:
        4.7620935 = fieldWeight in 4740, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.61935 = idf(docFreq=58, maxDocs=44218)
          0.625 = fieldNorm(doc=4740)
    

Similar documents (content)

  1. Jörn, F.: Wie Google für uns nach der ominösen Gluonenkraft stöbert : Software-Krabbler machen sich vor der Anfrage auf die Suche - Das Netz ist etwa fünfhundertmal größer als alles Durchforschte (2001) 0.26
    0.2624302 = sum of:
      0.2624302 = product of:
        0.46862534 = sum of:
          0.017721271 = weight(abstract_txt:verstehen in 3684) [ClassicSimilarity], result of:
            0.017721271 = score(doc=3684,freq=1.0), product of:
              0.14373775 = queryWeight, product of:
                6.312392 = idf(docFreq=217, maxDocs=44218)
                0.022770725 = queryNorm
              0.123288915 = fieldWeight in 3684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.312392 = idf(docFreq=217, maxDocs=44218)
                0.01953125 = fieldNorm(doc=3684)
          0.01913993 = weight(abstract_txt:etwas in 3684) [ClassicSimilarity], result of:
            0.01913993 = score(doc=3684,freq=1.0), product of:
              0.15131007 = queryWeight, product of:
                1.0260026 = boost
                6.4765315 = idf(docFreq=184, maxDocs=44218)
                0.022770725 = queryNorm
              0.12649475 = fieldWeight in 3684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4765315 = idf(docFreq=184, maxDocs=44218)
                0.01953125 = fieldNorm(doc=3684)
          0.021650242 = weight(abstract_txt:lange in 3684) [ClassicSimilarity], result of:
            0.021650242 = score(doc=3684,freq=1.0), product of:
              0.16426666 = queryWeight, product of:
                1.0690285 = boost
                6.7481275 = idf(docFreq=140, maxDocs=44218)
                0.022770725 = queryNorm
              0.13179937 = fieldWeight in 3684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7481275 = idf(docFreq=140, maxDocs=44218)
                0.01953125 = fieldNorm(doc=3684)
          0.0055302694 = weight(abstract_txt:also in 3684) [ClassicSimilarity], result of:
            0.0055302694 = score(doc=3684,freq=1.0), product of:
              0.08331985 = queryWeight, product of:
                1.0767226 = boost
                3.3983476 = idf(docFreq=4017, maxDocs=44218)
                0.022770725 = queryNorm
              0.066373974 = fieldWeight in 3684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3983476 = idf(docFreq=4017, maxDocs=44218)
                0.01953125 = fieldNorm(doc=3684)
          0.021423506 = weight(abstract_txt:haben in 3684) [ClassicSimilarity], result of:
            0.021423506 = score(doc=3684,freq=4.0), product of:
              0.11762828 = queryWeight, product of:
                1.1079396 = boost
                4.6624994 = idf(docFreq=1134, maxDocs=44218)
                0.022770725 = queryNorm
              0.18212888 = fieldWeight in 3684, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.6624994 = idf(docFreq=1134, maxDocs=44218)
                0.01953125 = fieldNorm(doc=3684)
          0.031725697 = weight(abstract_txt:noch in 3684) [ClassicSimilarity], result of:
            0.031725697 = score(doc=3684,freq=8.0), product of:
              0.12129665 = queryWeight, product of:
                1.1250831 = boost
                4.734644 = idf(docFreq=1055, maxDocs=44218)
                0.022770725 = queryNorm
              0.2615546 = fieldWeight in 3684, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.734644 = idf(docFreq=1055, maxDocs=44218)
                0.01953125 = fieldNorm(doc=3684)
          0.011479217 = weight(abstract_txt:wurde in 3684) [ClassicSimilarity], result of:
            0.011479217 = score(doc=3684,freq=1.0), product of:
              0.1231817 = queryWeight, product of:
                1.1337918 = boost
                4.771292 = idf(docFreq=1017, maxDocs=44218)
                0.022770725 = queryNorm
              0.0931893 = fieldWeight in 3684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.771292 = idf(docFreq=1017, maxDocs=44218)
                0.01953125 = fieldNorm(doc=3684)
          0.030095333 = weight(abstract_txt:zeit in 3684) [ClassicSimilarity], result of:
            0.030095333 = score(doc=3684,freq=3.0), product of:
              0.16239166 = queryWeight, product of:
                1.3017935 = boost
                5.478287 = idf(docFreq=501, maxDocs=44218)
                0.022770725 = queryNorm
              0.18532561 = fieldWeight in 3684, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.478287 = idf(docFreq=501, maxDocs=44218)
                0.01953125 = fieldNorm(doc=3684)
          0.05252432 = weight(abstract_txt:dann in 3684) [ClassicSimilarity], result of:
            0.05252432 = score(doc=3684,freq=8.0), product of:
              0.16975205 = queryWeight, product of:
                1.3309684 = boost
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.022770725 = queryNorm
              0.3094179 = fieldWeight in 3684, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.01953125 = fieldNorm(doc=3684)
          0.02442139 = weight(abstract_txt:auch in 3684) [ClassicSimilarity], result of:
            0.02442139 = score(doc=3684,freq=7.0), product of:
              0.12628987 = queryWeight, product of:
                1.4820704 = boost
                3.742164 = idf(docFreq=2848, maxDocs=44218)
                0.022770725 = queryNorm
              0.19337568 = fieldWeight in 3684, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.742164 = idf(docFreq=2848, maxDocs=44218)
                0.01953125 = fieldNorm(doc=3684)
          0.092182264 = weight(abstract_txt:millionen in 3684) [ClassicSimilarity], result of:
            0.092182264 = score(doc=3684,freq=10.0), product of:
              0.2292818 = queryWeight, product of:
                1.5468385 = boost
                6.5095015 = idf(docFreq=178, maxDocs=44218)
                0.022770725 = queryNorm
              0.4020479 = fieldWeight in 3684, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                6.5095015 = idf(docFreq=178, maxDocs=44218)
                0.01953125 = fieldNorm(doc=3684)
          0.079812646 = weight(abstract_txt:einmal in 3684) [ClassicSimilarity], result of:
            0.079812646 = score(doc=3684,freq=4.0), product of:
              0.31113252 = queryWeight, product of:
                2.0806656 = boost
                6.5669885 = idf(docFreq=168, maxDocs=44218)
                0.022770725 = queryNorm
              0.25652298 = fieldWeight in 3684, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.5669885 = idf(docFreq=168, maxDocs=44218)
                0.01953125 = fieldNorm(doc=3684)
          0.03785779 = weight(abstract_txt:eine in 3684) [ClassicSimilarity], result of:
            0.03785779 = score(doc=3684,freq=10.0), product of:
              0.17567007 = queryWeight, product of:
                2.211024 = boost
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.022770725 = queryNorm
              0.21550506 = fieldWeight in 3684, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.01953125 = fieldNorm(doc=3684)
          0.023061441 = weight(abstract_txt:text in 3684) [ClassicSimilarity], result of:
            0.023061441 = score(doc=3684,freq=2.0), product of:
              0.20646411 = queryWeight, product of:
                2.242182 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.022770725 = queryNorm
              0.11169709 = fieldWeight in 3684, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01953125 = fieldNorm(doc=3684)
        0.56 = coord(14/25)
    
  2. Arns, C.: Fallstricke Online : Über die eigenen Worte gestolpert (2005) 0.20
    0.20146713 = sum of:
      0.20146713 = product of:
        0.6295848 = sum of:
          0.061247777 = weight(abstract_txt:etwas in 3502) [ClassicSimilarity], result of:
            0.061247777 = score(doc=3502,freq=1.0), product of:
              0.15131007 = queryWeight, product of:
                1.0260026 = boost
                6.4765315 = idf(docFreq=184, maxDocs=44218)
                0.022770725 = queryNorm
              0.40478322 = fieldWeight in 3502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4765315 = idf(docFreq=184, maxDocs=44218)
                0.0625 = fieldNorm(doc=3502)
          0.03427761 = weight(abstract_txt:haben in 3502) [ClassicSimilarity], result of:
            0.03427761 = score(doc=3502,freq=1.0), product of:
              0.11762828 = queryWeight, product of:
                1.1079396 = boost
                4.6624994 = idf(docFreq=1134, maxDocs=44218)
                0.022770725 = queryNorm
              0.2914062 = fieldWeight in 3502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6624994 = idf(docFreq=1134, maxDocs=44218)
                0.0625 = fieldNorm(doc=3502)
          0.07782169 = weight(abstract_txt:menge in 3502) [ClassicSimilarity], result of:
            0.07782169 = score(doc=3502,freq=1.0), product of:
              0.17750396 = queryWeight, product of:
                1.1112674 = boost
                7.014756 = idf(docFreq=107, maxDocs=44218)
                0.022770725 = queryNorm
              0.43842226 = fieldWeight in 3502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.014756 = idf(docFreq=107, maxDocs=44218)
                0.0625 = fieldNorm(doc=3502)
          0.03589353 = weight(abstract_txt:noch in 3502) [ClassicSimilarity], result of:
            0.03589353 = score(doc=3502,freq=1.0), product of:
              0.12129665 = queryWeight, product of:
                1.1250831 = boost
                4.734644 = idf(docFreq=1055, maxDocs=44218)
                0.022770725 = queryNorm
              0.29591525 = fieldWeight in 3502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.734644 = idf(docFreq=1055, maxDocs=44218)
                0.0625 = fieldNorm(doc=3502)
          0.10292624 = weight(abstract_txt:dann in 3502) [ClassicSimilarity], result of:
            0.10292624 = score(doc=3502,freq=3.0), product of:
              0.16975205 = queryWeight, product of:
                1.3309684 = boost
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.022770725 = queryNorm
              0.60633284 = fieldWeight in 3502, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.0625 = fieldNorm(doc=3502)
          0.051160168 = weight(abstract_txt:auch in 3502) [ClassicSimilarity], result of:
            0.051160168 = score(doc=3502,freq=3.0), product of:
              0.12628987 = queryWeight, product of:
                1.4820704 = boost
                3.742164 = idf(docFreq=2848, maxDocs=44218)
                0.022770725 = queryNorm
              0.40510112 = fieldWeight in 3502, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.742164 = idf(docFreq=2848, maxDocs=44218)
                0.0625 = fieldNorm(doc=3502)
          0.18059538 = weight(abstract_txt:einmal in 3502) [ClassicSimilarity], result of:
            0.18059538 = score(doc=3502,freq=2.0), product of:
              0.31113252 = queryWeight, product of:
                2.0806656 = boost
                6.5669885 = idf(docFreq=168, maxDocs=44218)
                0.022770725 = queryNorm
              0.58044523 = fieldWeight in 3502, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5669885 = idf(docFreq=168, maxDocs=44218)
                0.0625 = fieldNorm(doc=3502)
          0.0856624 = weight(abstract_txt:eine in 3502) [ClassicSimilarity], result of:
            0.0856624 = score(doc=3502,freq=5.0), product of:
              0.17567007 = queryWeight, product of:
                2.211024 = boost
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.022770725 = queryNorm
              0.4876323 = fieldWeight in 3502, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.0625 = fieldNorm(doc=3502)
        0.32 = coord(8/25)
    
  3. Erben, K.M.: ¬Das Internet wird menschlich : Web-Guides sind die neuen Pfadfinder im Dschungel des Netzes (2001) 0.19
    0.19498572 = sum of:
      0.19498572 = product of:
        0.48746428 = sum of:
          0.043300483 = weight(abstract_txt:lange in 5735) [ClassicSimilarity], result of:
            0.043300483 = score(doc=5735,freq=1.0), product of:
              0.16426666 = queryWeight, product of:
                1.0690285 = boost
                6.7481275 = idf(docFreq=140, maxDocs=44218)
                0.022770725 = queryNorm
              0.26359874 = fieldWeight in 5735, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7481275 = idf(docFreq=140, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5735)
          0.030297413 = weight(abstract_txt:haben in 5735) [ClassicSimilarity], result of:
            0.030297413 = score(doc=5735,freq=2.0), product of:
              0.11762828 = queryWeight, product of:
                1.1079396 = boost
                4.6624994 = idf(docFreq=1134, maxDocs=44218)
                0.022770725 = queryNorm
              0.25756913 = fieldWeight in 5735, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6624994 = idf(docFreq=1134, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5735)
          0.038855884 = weight(abstract_txt:noch in 5735) [ClassicSimilarity], result of:
            0.038855884 = score(doc=5735,freq=3.0), product of:
              0.12129665 = queryWeight, product of:
                1.1250831 = boost
                4.734644 = idf(docFreq=1055, maxDocs=44218)
                0.022770725 = queryNorm
              0.32033765 = fieldWeight in 5735, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.734644 = idf(docFreq=1055, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5735)
          0.055726632 = weight(abstract_txt:eben in 5735) [ClassicSimilarity], result of:
            0.055726632 = score(doc=5735,freq=1.0), product of:
              0.19435519 = queryWeight, product of:
                1.1628205 = boost
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.022770725 = queryNorm
              0.28672573 = fieldWeight in 5735, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5735)
          0.08482903 = weight(abstract_txt:kennen in 5735) [ClassicSimilarity], result of:
            0.08482903 = score(doc=5735,freq=2.0), product of:
              0.20413022 = queryWeight, product of:
                1.1917036 = boost
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.022770725 = queryNorm
              0.41556332 = fieldWeight in 5735, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5735)
          0.0347511 = weight(abstract_txt:zeit in 5735) [ClassicSimilarity], result of:
            0.0347511 = score(doc=5735,freq=1.0), product of:
              0.16239166 = queryWeight, product of:
                1.3017935 = boost
                5.478287 = idf(docFreq=501, maxDocs=44218)
                0.022770725 = queryNorm
              0.21399559 = fieldWeight in 5735, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.478287 = idf(docFreq=501, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5735)
          0.037140306 = weight(abstract_txt:dann in 5735) [ClassicSimilarity], result of:
            0.037140306 = score(doc=5735,freq=1.0), product of:
              0.16975205 = queryWeight, product of:
                1.3309684 = boost
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.022770725 = queryNorm
              0.21879151 = fieldWeight in 5735, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5735)
          0.041279685 = weight(abstract_txt:auch in 5735) [ClassicSimilarity], result of:
            0.041279685 = score(doc=5735,freq=5.0), product of:
              0.12628987 = queryWeight, product of:
                1.4820704 = boost
                3.742164 = idf(docFreq=2848, maxDocs=44218)
                0.022770725 = queryNorm
              0.32686457 = fieldWeight in 5735, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.742164 = idf(docFreq=2848, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5735)
          0.079812646 = weight(abstract_txt:einmal in 5735) [ClassicSimilarity], result of:
            0.079812646 = score(doc=5735,freq=1.0), product of:
              0.31113252 = queryWeight, product of:
                2.0806656 = boost
                6.5669885 = idf(docFreq=168, maxDocs=44218)
                0.022770725 = queryNorm
              0.25652298 = fieldWeight in 5735, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5669885 = idf(docFreq=168, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5735)
          0.04147113 = weight(abstract_txt:eine in 5735) [ClassicSimilarity], result of:
            0.04147113 = score(doc=5735,freq=3.0), product of:
              0.17567007 = queryWeight, product of:
                2.211024 = boost
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.022770725 = queryNorm
              0.23607397 = fieldWeight in 5735, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5735)
        0.4 = coord(10/25)
    
  4. Taglinger, H.: Ausgevogelt, jetzt wird es ernst (2018) 0.18
    0.17809053 = sum of:
      0.17809053 = product of:
        0.49469587 = sum of:
          0.049619555 = weight(abstract_txt:verstehen in 4281) [ClassicSimilarity], result of:
            0.049619555 = score(doc=4281,freq=1.0), product of:
              0.14373775 = queryWeight, product of:
                6.312392 = idf(docFreq=217, maxDocs=44218)
                0.022770725 = queryNorm
              0.34520894 = fieldWeight in 4281, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.312392 = idf(docFreq=217, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4281)
          0.021898752 = weight(abstract_txt:also in 4281) [ClassicSimilarity], result of:
            0.021898752 = score(doc=4281,freq=2.0), product of:
              0.08331985 = queryWeight, product of:
                1.0767226 = boost
                3.3983476 = idf(docFreq=4017, maxDocs=44218)
                0.022770725 = queryNorm
              0.26282755 = fieldWeight in 4281, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3983476 = idf(docFreq=4017, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4281)
          0.06809398 = weight(abstract_txt:menge in 4281) [ClassicSimilarity], result of:
            0.06809398 = score(doc=4281,freq=1.0), product of:
              0.17750396 = queryWeight, product of:
                1.1112674 = boost
                7.014756 = idf(docFreq=107, maxDocs=44218)
                0.022770725 = queryNorm
              0.3836195 = fieldWeight in 4281, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.014756 = idf(docFreq=107, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4281)
          0.054398235 = weight(abstract_txt:noch in 4281) [ClassicSimilarity], result of:
            0.054398235 = score(doc=4281,freq=3.0), product of:
              0.12129665 = queryWeight, product of:
                1.1250831 = boost
                4.734644 = idf(docFreq=1055, maxDocs=44218)
                0.022770725 = queryNorm
              0.44847268 = fieldWeight in 4281, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.734644 = idf(docFreq=1055, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4281)
          0.07801729 = weight(abstract_txt:eben in 4281) [ClassicSimilarity], result of:
            0.07801729 = score(doc=4281,freq=1.0), product of:
              0.19435519 = queryWeight, product of:
                1.1628205 = boost
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.022770725 = queryNorm
              0.401416 = fieldWeight in 4281, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4281)
          0.048651543 = weight(abstract_txt:zeit in 4281) [ClassicSimilarity], result of:
            0.048651543 = score(doc=4281,freq=1.0), product of:
              0.16239166 = queryWeight, product of:
                1.3017935 = boost
                5.478287 = idf(docFreq=501, maxDocs=44218)
                0.022770725 = queryNorm
              0.29959384 = fieldWeight in 4281, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.478287 = idf(docFreq=501, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4281)
          0.090060465 = weight(abstract_txt:dann in 4281) [ClassicSimilarity], result of:
            0.090060465 = score(doc=4281,freq=3.0), product of:
              0.16975205 = queryWeight, product of:
                1.3309684 = boost
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.022770725 = queryNorm
              0.53054124 = fieldWeight in 4281, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4281)
          0.03655059 = weight(abstract_txt:auch in 4281) [ClassicSimilarity], result of:
            0.03655059 = score(doc=4281,freq=2.0), product of:
              0.12628987 = queryWeight, product of:
                1.4820704 = boost
                3.742164 = idf(docFreq=2848, maxDocs=44218)
                0.022770725 = queryNorm
              0.28941822 = fieldWeight in 4281, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.742164 = idf(docFreq=2848, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4281)
          0.04740545 = weight(abstract_txt:eine in 4281) [ClassicSimilarity], result of:
            0.04740545 = score(doc=4281,freq=2.0), product of:
              0.17567007 = queryWeight, product of:
                2.211024 = boost
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.022770725 = queryNorm
              0.26985502 = fieldWeight in 4281, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4281)
        0.36 = coord(9/25)
    
  5. Heyer, G.; Quasthoff, U.; Wittig, T.: Text Mining : Wissensrohstoff Text. Konzepte, Algorithmen, Ergebnisse (2006) 0.17
    0.16624476 = sum of:
      0.16624476 = product of:
        0.5937313 = sum of:
          0.074530005 = weight(abstract_txt:technologie in 5218) [ClassicSimilarity], result of:
            0.074530005 = score(doc=5218,freq=3.0), product of:
              0.16358167 = queryWeight, product of:
                1.0667973 = boost
                6.7340426 = idf(docFreq=142, maxDocs=44218)
                0.022770725 = queryNorm
              0.45561343 = fieldWeight in 5218, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.7340426 = idf(docFreq=142, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5218)
          0.06123613 = weight(abstract_txt:lange in 5218) [ClassicSimilarity], result of:
            0.06123613 = score(doc=5218,freq=2.0), product of:
              0.16426666 = queryWeight, product of:
                1.0690285 = boost
                6.7481275 = idf(docFreq=140, maxDocs=44218)
                0.022770725 = queryNorm
              0.37278488 = fieldWeight in 5218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.7481275 = idf(docFreq=140, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5218)
          0.031725697 = weight(abstract_txt:noch in 5218) [ClassicSimilarity], result of:
            0.031725697 = score(doc=5218,freq=2.0), product of:
              0.12129665 = queryWeight, product of:
                1.1250831 = boost
                4.734644 = idf(docFreq=1055, maxDocs=44218)
                0.022770725 = queryNorm
              0.2615546 = fieldWeight in 5218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.734644 = idf(docFreq=1055, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5218)
          0.026107565 = weight(abstract_txt:auch in 5218) [ClassicSimilarity], result of:
            0.026107565 = score(doc=5218,freq=2.0), product of:
              0.12628987 = queryWeight, product of:
                1.4820704 = boost
                3.742164 = idf(docFreq=2848, maxDocs=44218)
                0.022770725 = queryNorm
              0.2067273 = fieldWeight in 5218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.742164 = idf(docFreq=2848, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5218)
          0.04788674 = weight(abstract_txt:eine in 5218) [ClassicSimilarity], result of:
            0.04788674 = score(doc=5218,freq=4.0), product of:
              0.17567007 = queryWeight, product of:
                2.211024 = boost
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.022770725 = queryNorm
              0.27259475 = fieldWeight in 5218, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5218)
          0.11759073 = weight(abstract_txt:text in 5218) [ClassicSimilarity], result of:
            0.11759073 = score(doc=5218,freq=13.0), product of:
              0.20646411 = queryWeight, product of:
                2.242182 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.022770725 = queryNorm
              0.5695456 = fieldWeight in 5218, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5218)
          0.23465446 = weight(abstract_txt:mining in 5218) [ClassicSimilarity], result of:
            0.23465446 = score(doc=5218,freq=8.0), product of:
              0.34391952 = queryWeight, product of:
                2.4457552 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.022770725 = queryNorm
              0.68229467 = fieldWeight in 5218, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5218)
        0.28 = coord(7/25)