Document (#31988)

Author
Halip, I.
Title
Automatische Extrahierung von Schlagworten aus unstrukturierten Texten
Imprint
Münster : Institut für Wirtschaftsinformatik der Westfälische Wilhelms-Universität Münster
Year
2005
Pages
24 S
Abstract
Durch die zunehmende Mediatisierung und Digitalisierung wird die moderne Gesellschaft immer mehr mit dem Thema der Informationsüberflutung konfrontiert. Erstaunlicherweise führt der Zuwachs an Informationen gleichzeitig zu einem Mangel an Wissen. Die Erklärung kann darin gefunden werden, dass ein großer Teil der existierenden Informationen nicht aufgefunden werden kann. Es handelt sich meistens um Informationen die auf semi- und nichtstrukturierte Daten beruhen. Schätzungen zufolge sind heute rund 80% der entscheidungsrelevanten Informationen in Unternehmen in unstrukturierter, d. h. meist textueller Form vorhanden. Die Unfähigkeit der Maschinen den Inhalt unstrukturierter Texte zu verstehen führt dazu, dass dokumentiertes Wissen schwer auffindbar ist und oft unentdeckt bleibt. Wegen des Informationsvolumens, das meistens zu groß ist, um gelesen, verstanden oder sogar benutzt zu werden, ergibt sich folgendes Problem, mit dem man konfrontiert wird: Informationen die nicht in Wissen umgewandelt werden können, bleiben als papiergebundene oder digitale Dokumente in Data-Repositories verschlossen. Angesichts der heute anfallenden Menge an Dokumenten erscheint eine manuelle Vergabe von Schlagworten nicht mehr realistisch. Deshalb entwickelt Wissensmanagement unterstützende Verfahren, die Informationen rechtzeitig, in der richtigen Qualität und den richtigen Personen verfügbar machen. Einige Schwerpunkte an denen zur Zeit geforscht wird, sind Modelle zur Repräsentation von Dokumenten, Methoden zur Ähnlichkeitsbestimmung von Anfragen zu Dokumenten und zur Indexierung von Dokumentenmengen, sowie die automatische Klassifikation. Vor diesem Hintergrund konzentriert sich diese Arbeit auf die unterschiedlichen Verfahren der automatischen Indexierung, hebt die algorithmischen Vor- und Nachteile hervor, mit dem Ziel die Funktionsweise im Bereich der unstrukturierten Texte zu analysieren. Hierfür erfolgt im 3. Kapitel eine genauere Untersuchung und Darstellung automatischer Indexierungsverfahren. Zuvor werden in Kapitel 2 grundlegende Begrifflichkeiten erklärt, eingeordnet und abgegrenzt. Abschließend werden anhand der theoretischen Darlegung Implementierungen der vorgestellten Verfahren kurz beschrieben. Die Ausarbeitung endet mit der Schlussfolgerung und dem Ausblick.
Content
Ausarbeitung im Rahmen des Seminars Suchmaschinen und Suchalgorithmen, Institut für Wirtschaftsinformatik Praktische Informatik in der Wirtschaft, Westfälische Wilhelms-Universität Münster
Theme
Automatisches Indexieren
Object
IDX

Similar documents (content)

  1. Loonus, Y.: Einsatzbereiche der KI und ihre Relevanz für Information Professionals (2017) 0.28
    0.28270137 = sum of:
      0.28270137 = product of:
        0.8834418 = sum of:
          0.03353741 = weight(abstract_txt:mehr in 1669) [ClassicSimilarity], result of:
            0.03353741 = score(doc=1669,freq=1.0), product of:
              0.0881986 = queryWeight, product of:
                1.0017896 = boost
                4.8671846 = idf(docFreq=893, maxDocs=42740)
                0.0180887 = queryNorm
              0.38024879 = fieldWeight in 1669, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8671846 = idf(docFreq=893, maxDocs=42740)
                0.078125 = fieldNorm(doc=1669)
          0.027334647 = weight(abstract_txt:nicht in 1669) [ClassicSimilarity], result of:
            0.027334647 = score(doc=1669,freq=1.0), product of:
              0.08809435 = queryWeight, product of:
                1.2262114 = boost
                3.9716904 = idf(docFreq=2188, maxDocs=42740)
                0.0180887 = queryNorm
              0.3102883 = fieldWeight in 1669, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9716904 = idf(docFreq=2188, maxDocs=42740)
                0.078125 = fieldNorm(doc=1669)
          0.07278607 = weight(abstract_txt:führt in 1669) [ClassicSimilarity], result of:
            0.07278607 = score(doc=1669,freq=1.0), product of:
              0.1478453 = queryWeight, product of:
                1.2970282 = boost
                6.3015985 = idf(docFreq=212, maxDocs=42740)
                0.0180887 = queryNorm
              0.49231237 = fieldWeight in 1669, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3015985 = idf(docFreq=212, maxDocs=42740)
                0.078125 = fieldNorm(doc=1669)
          0.09351949 = weight(abstract_txt:texte in 1669) [ClassicSimilarity], result of:
            0.09351949 = score(doc=1669,freq=1.0), product of:
              0.17473371 = queryWeight, product of:
                1.4100484 = boost
                6.850706 = idf(docFreq=122, maxDocs=42740)
                0.0180887 = queryNorm
              0.53521144 = fieldWeight in 1669, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.850706 = idf(docFreq=122, maxDocs=42740)
                0.078125 = fieldNorm(doc=1669)
          0.19275877 = weight(abstract_txt:unstrukturierten in 1669) [ClassicSimilarity], result of:
            0.19275877 = score(doc=1669,freq=1.0), product of:
              0.2829989 = queryWeight, product of:
                1.7944776 = boost
                8.7184515 = idf(docFreq=18, maxDocs=42740)
                0.0180887 = queryNorm
              0.68112904 = fieldWeight in 1669, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.7184515 = idf(docFreq=18, maxDocs=42740)
                0.078125 = fieldNorm(doc=1669)
          0.11756005 = weight(abstract_txt:dokumenten in 1669) [ClassicSimilarity], result of:
            0.11756005 = score(doc=1669,freq=1.0), product of:
              0.23297659 = queryWeight, product of:
                1.9941022 = boost
                6.458884 = idf(docFreq=181, maxDocs=42740)
                0.0180887 = queryNorm
              0.5046003 = fieldWeight in 1669, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.458884 = idf(docFreq=181, maxDocs=42740)
                0.078125 = fieldNorm(doc=1669)
          0.27976725 = weight(abstract_txt:unstrukturierter in 1669) [ClassicSimilarity], result of:
            0.27976725 = score(doc=1669,freq=1.0), product of:
              0.36277714 = queryWeight, product of:
                2.0317283 = boost
                9.871131 = idf(docFreq=5, maxDocs=42740)
                0.0180887 = queryNorm
              0.7711821 = fieldWeight in 1669, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.871131 = idf(docFreq=5, maxDocs=42740)
                0.078125 = fieldNorm(doc=1669)
          0.066178076 = weight(abstract_txt:werden in 1669) [ClassicSimilarity], result of:
            0.066178076 = score(doc=1669,freq=3.0), product of:
              0.13875589 = queryWeight, product of:
                2.1763663 = boost
                3.524618 = idf(docFreq=3422, maxDocs=42740)
                0.0180887 = queryNorm
              0.47693884 = fieldWeight in 1669, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.524618 = idf(docFreq=3422, maxDocs=42740)
                0.078125 = fieldNorm(doc=1669)
        0.32 = coord(8/25)
    
  2. Kaiser, A.: Computer-unterstütztes Indexieren in Intelligenten Information Retrieval Systemen : Ein Relevanz-Feedback orientierter Ansatz zur Informationserschließung in unformatierten Datenbanken (1993) 0.27
    0.26628205 = sum of:
      0.26628205 = product of:
        0.55475426 = sum of:
          0.016146852 = weight(abstract_txt:sich in 1285) [ClassicSimilarity], result of:
            0.016146852 = score(doc=1285,freq=4.0), product of:
              0.071967706 = queryWeight, product of:
                1.1083071 = boost
                3.5897994 = idf(docFreq=3206, maxDocs=42740)
                0.0180887 = queryNorm
              0.22436246 = fieldWeight in 1285, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.5897994 = idf(docFreq=3206, maxDocs=42740)
                0.03125 = fieldNorm(doc=1285)
          0.018754419 = weight(abstract_txt:heute in 1285) [ClassicSimilarity], result of:
            0.018754419 = score(doc=1285,freq=1.0), product of:
              0.11027348 = queryWeight, product of:
                1.120163 = boost
                5.4423003 = idf(docFreq=502, maxDocs=42740)
                0.0180887 = queryNorm
              0.17007188 = fieldWeight in 1285, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4423003 = idf(docFreq=502, maxDocs=42740)
                0.03125 = fieldNorm(doc=1285)
          0.021284088 = weight(abstract_txt:wird in 1285) [ClassicSimilarity], result of:
            0.021284088 = score(doc=1285,freq=5.0), product of:
              0.08031792 = queryWeight, product of:
                1.1708399 = boost
                3.7923427 = idf(docFreq=2618, maxDocs=42740)
                0.0180887 = queryNorm
              0.264998 = fieldWeight in 1285, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.7923427 = idf(docFreq=2618, maxDocs=42740)
                0.03125 = fieldNorm(doc=1285)
          0.021867719 = weight(abstract_txt:nicht in 1285) [ClassicSimilarity], result of:
            0.021867719 = score(doc=1285,freq=4.0), product of:
              0.08809435 = queryWeight, product of:
                1.2262114 = boost
                3.9716904 = idf(docFreq=2188, maxDocs=42740)
                0.0180887 = queryNorm
              0.24823065 = fieldWeight in 1285, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.9716904 = idf(docFreq=2188, maxDocs=42740)
                0.03125 = fieldNorm(doc=1285)
          0.08765738 = weight(abstract_txt:indexierung in 1285) [ClassicSimilarity], result of:
            0.08765738 = score(doc=1285,freq=6.0), product of:
              0.1696461 = queryWeight, product of:
                1.389369 = boost
                6.7502356 = idf(docFreq=135, maxDocs=42740)
                0.0180887 = queryNorm
              0.5167073 = fieldWeight in 1285, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.7502356 = idf(docFreq=135, maxDocs=42740)
                0.03125 = fieldNorm(doc=1285)
          0.03881413 = weight(abstract_txt:automatische in 1285) [ClassicSimilarity], result of:
            0.03881413 = score(doc=1285,freq=1.0), product of:
              0.17908612 = queryWeight, product of:
                1.4275017 = boost
                6.9355025 = idf(docFreq=112, maxDocs=42740)
                0.0180887 = queryNorm
              0.21673445 = fieldWeight in 1285, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9355025 = idf(docFreq=112, maxDocs=42740)
                0.03125 = fieldNorm(doc=1285)
          0.033589233 = weight(abstract_txt:wissen in 1285) [ClassicSimilarity], result of:
            0.033589233 = score(doc=1285,freq=2.0), product of:
              0.14775963 = queryWeight, product of:
                1.5880684 = boost
                5.143743 = idf(docFreq=677, maxDocs=42740)
                0.0180887 = queryNorm
              0.22732347 = fieldWeight in 1285, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.143743 = idf(docFreq=677, maxDocs=42740)
                0.03125 = fieldNorm(doc=1285)
          0.05548386 = weight(abstract_txt:konfrontiert in 1285) [ClassicSimilarity], result of:
            0.05548386 = score(doc=1285,freq=1.0), product of:
              0.22725482 = queryWeight, product of:
                1.6080599 = boost
                7.8127427 = idf(docFreq=46, maxDocs=42740)
                0.0180887 = queryNorm
              0.24414821 = fieldWeight in 1285, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8127427 = idf(docFreq=46, maxDocs=42740)
                0.03125 = fieldNorm(doc=1285)
          0.03359545 = weight(abstract_txt:verfahren in 1285) [ClassicSimilarity], result of:
            0.03359545 = score(doc=1285,freq=1.0), product of:
              0.18618844 = queryWeight, product of:
                1.7826564 = boost
                5.7740126 = idf(docFreq=360, maxDocs=42740)
                0.0180887 = queryNorm
              0.1804379 = fieldWeight in 1285, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7740126 = idf(docFreq=360, maxDocs=42740)
                0.03125 = fieldNorm(doc=1285)
          0.094048046 = weight(abstract_txt:dokumenten in 1285) [ClassicSimilarity], result of:
            0.094048046 = score(doc=1285,freq=4.0), product of:
              0.23297659 = queryWeight, product of:
                1.9941022 = boost
                6.458884 = idf(docFreq=181, maxDocs=42740)
                0.0180887 = queryNorm
              0.40368024 = fieldWeight in 1285, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.458884 = idf(docFreq=181, maxDocs=42740)
                0.03125 = fieldNorm(doc=1285)
          0.03743597 = weight(abstract_txt:werden in 1285) [ClassicSimilarity], result of:
            0.03743597 = score(doc=1285,freq=6.0), product of:
              0.13875589 = queryWeight, product of:
                2.1763663 = boost
                3.524618 = idf(docFreq=3422, maxDocs=42740)
                0.0180887 = queryNorm
              0.26979735 = fieldWeight in 1285, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.524618 = idf(docFreq=3422, maxDocs=42740)
                0.03125 = fieldNorm(doc=1285)
          0.096077144 = weight(abstract_txt:informationen in 1285) [ClassicSimilarity], result of:
            0.096077144 = score(doc=1285,freq=5.0), product of:
              0.27639645 = queryWeight, product of:
                3.0716555 = boost
                4.974536 = idf(docFreq=802, maxDocs=42740)
                0.0180887 = queryNorm
              0.34760627 = fieldWeight in 1285, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.974536 = idf(docFreq=802, maxDocs=42740)
                0.03125 = fieldNorm(doc=1285)
        0.48 = coord(12/25)
    
  3. Nohr, H.: Automatische Indexierung : Einführung in betriebliche Verfahren, Systeme und Anwendungen (2001) 0.26
    0.25778 = sum of:
      0.25778 = product of:
        0.80556244 = sum of:
          0.02682993 = weight(abstract_txt:mehr in 3544) [ClassicSimilarity], result of:
            0.02682993 = score(doc=3544,freq=1.0), product of:
              0.0881986 = queryWeight, product of:
                1.0017896 = boost
                4.8671846 = idf(docFreq=893, maxDocs=42740)
                0.0180887 = queryNorm
              0.30419904 = fieldWeight in 3544, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8671846 = idf(docFreq=893, maxDocs=42740)
                0.0625 = fieldNorm(doc=3544)
          0.016146852 = weight(abstract_txt:sich in 3544) [ClassicSimilarity], result of:
            0.016146852 = score(doc=3544,freq=1.0), product of:
              0.071967706 = queryWeight, product of:
                1.1083071 = boost
                3.5897994 = idf(docFreq=3206, maxDocs=42740)
                0.0180887 = queryNorm
              0.22436246 = fieldWeight in 3544, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5897994 = idf(docFreq=3206, maxDocs=42740)
                0.0625 = fieldNorm(doc=3544)
          0.14314389 = weight(abstract_txt:indexierung in 3544) [ClassicSimilarity], result of:
            0.14314389 = score(doc=3544,freq=4.0), product of:
              0.1696461 = queryWeight, product of:
                1.389369 = boost
                6.7502356 = idf(docFreq=135, maxDocs=42740)
                0.0180887 = queryNorm
              0.84377944 = fieldWeight in 3544, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.7502356 = idf(docFreq=135, maxDocs=42740)
                0.0625 = fieldNorm(doc=3544)
          0.15024343 = weight(abstract_txt:verfahren in 3544) [ClassicSimilarity], result of:
            0.15024343 = score(doc=3544,freq=5.0), product of:
              0.18618844 = queryWeight, product of:
                1.7826564 = boost
                5.7740126 = idf(docFreq=360, maxDocs=42740)
                0.0180887 = queryNorm
              0.8069428 = fieldWeight in 3544, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.7740126 = idf(docFreq=360, maxDocs=42740)
                0.0625 = fieldNorm(doc=3544)
          0.154207 = weight(abstract_txt:unstrukturierten in 3544) [ClassicSimilarity], result of:
            0.154207 = score(doc=3544,freq=1.0), product of:
              0.2829989 = queryWeight, product of:
                1.7944776 = boost
                8.7184515 = idf(docFreq=18, maxDocs=42740)
                0.0180887 = queryNorm
              0.5449032 = fieldWeight in 3544, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.7184515 = idf(docFreq=18, maxDocs=42740)
                0.0625 = fieldNorm(doc=3544)
          0.16289599 = weight(abstract_txt:dokumenten in 3544) [ClassicSimilarity], result of:
            0.16289599 = score(doc=3544,freq=3.0), product of:
              0.23297659 = queryWeight, product of:
                1.9941022 = boost
                6.458884 = idf(docFreq=181, maxDocs=42740)
                0.0180887 = queryNorm
              0.69919467 = fieldWeight in 3544, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.458884 = idf(docFreq=181, maxDocs=42740)
                0.0625 = fieldNorm(doc=3544)
          0.030566342 = weight(abstract_txt:werden in 3544) [ClassicSimilarity], result of:
            0.030566342 = score(doc=3544,freq=1.0), product of:
              0.13875589 = queryWeight, product of:
                2.1763663 = boost
                3.524618 = idf(docFreq=3422, maxDocs=42740)
                0.0180887 = queryNorm
              0.22028862 = fieldWeight in 3544, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.524618 = idf(docFreq=3422, maxDocs=42740)
                0.0625 = fieldNorm(doc=3544)
          0.121529035 = weight(abstract_txt:informationen in 3544) [ClassicSimilarity], result of:
            0.121529035 = score(doc=3544,freq=2.0), product of:
              0.27639645 = queryWeight, product of:
                3.0716555 = boost
                4.974536 = idf(docFreq=802, maxDocs=42740)
                0.0180887 = queryNorm
              0.439691 = fieldWeight in 3544, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.974536 = idf(docFreq=802, maxDocs=42740)
                0.0625 = fieldNorm(doc=3544)
        0.32 = coord(8/25)
    
  4. Helmbrecht-Schaar, A.: Entwicklung eines Verfahrens der automatischen Klassifizierung für Textdokumente aus dem Fachbereich Informatik mithilfe eines fachspezifischen Klassifikationssystems (2007) 0.20
    0.20426203 = sum of:
      0.20426203 = product of:
        0.72950727 = sum of:
          0.04759267 = weight(abstract_txt:wird in 3411) [ClassicSimilarity], result of:
            0.04759267 = score(doc=3411,freq=4.0), product of:
              0.08031792 = queryWeight, product of:
                1.1708399 = boost
                3.7923427 = idf(docFreq=2618, maxDocs=42740)
                0.0180887 = queryNorm
              0.59255356 = fieldWeight in 3411, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.7923427 = idf(docFreq=2618, maxDocs=42740)
                0.078125 = fieldNorm(doc=3411)
          0.09351949 = weight(abstract_txt:texte in 3411) [ClassicSimilarity], result of:
            0.09351949 = score(doc=3411,freq=1.0), product of:
              0.17473371 = queryWeight, product of:
                1.4100484 = boost
                6.850706 = idf(docFreq=122, maxDocs=42740)
                0.0180887 = queryNorm
              0.53521144 = fieldWeight in 3411, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.850706 = idf(docFreq=122, maxDocs=42740)
                0.078125 = fieldNorm(doc=3411)
          0.09703533 = weight(abstract_txt:automatische in 3411) [ClassicSimilarity], result of:
            0.09703533 = score(doc=3411,freq=1.0), product of:
              0.17908612 = queryWeight, product of:
                1.4275017 = boost
                6.9355025 = idf(docFreq=112, maxDocs=42740)
                0.0180887 = queryNorm
              0.54183614 = fieldWeight in 3411, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9355025 = idf(docFreq=112, maxDocs=42740)
                0.078125 = fieldNorm(doc=3411)
          0.14547257 = weight(abstract_txt:verfahren in 3411) [ClassicSimilarity], result of:
            0.14547257 = score(doc=3411,freq=3.0), product of:
              0.18618844 = queryWeight, product of:
                1.7826564 = boost
                5.7740126 = idf(docFreq=360, maxDocs=42740)
                0.0180887 = queryNorm
              0.781319 = fieldWeight in 3411, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.7740126 = idf(docFreq=360, maxDocs=42740)
                0.078125 = fieldNorm(doc=3411)
          0.11756005 = weight(abstract_txt:dokumenten in 3411) [ClassicSimilarity], result of:
            0.11756005 = score(doc=3411,freq=1.0), product of:
              0.23297659 = queryWeight, product of:
                1.9941022 = boost
                6.458884 = idf(docFreq=181, maxDocs=42740)
                0.0180887 = queryNorm
              0.5046003 = fieldWeight in 3411, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.458884 = idf(docFreq=181, maxDocs=42740)
                0.078125 = fieldNorm(doc=3411)
          0.07641585 = weight(abstract_txt:werden in 3411) [ClassicSimilarity], result of:
            0.07641585 = score(doc=3411,freq=4.0), product of:
              0.13875589 = queryWeight, product of:
                2.1763663 = boost
                3.524618 = idf(docFreq=3422, maxDocs=42740)
                0.0180887 = queryNorm
              0.5507215 = fieldWeight in 3411, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.524618 = idf(docFreq=3422, maxDocs=42740)
                0.078125 = fieldNorm(doc=3411)
          0.1519113 = weight(abstract_txt:informationen in 3411) [ClassicSimilarity], result of:
            0.1519113 = score(doc=3411,freq=2.0), product of:
              0.27639645 = queryWeight, product of:
                3.0716555 = boost
                4.974536 = idf(docFreq=802, maxDocs=42740)
                0.0180887 = queryNorm
              0.5496138 = fieldWeight in 3411, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.974536 = idf(docFreq=802, maxDocs=42740)
                0.078125 = fieldNorm(doc=3411)
        0.28 = coord(7/25)
    
  5. Witschel, H.F.: Terminologie-Extraktion : Möglichkeiten der Kombination statistischer uns musterbasierter Verfahren (2004) 0.20
    0.20083053 = sum of:
      0.20083053 = product of:
        0.5578626 = sum of:
          0.016146852 = weight(abstract_txt:sich in 2124) [ClassicSimilarity], result of:
            0.016146852 = score(doc=2124,freq=1.0), product of:
              0.071967706 = queryWeight, product of:
                1.1083071 = boost
                3.5897994 = idf(docFreq=3206, maxDocs=42740)
                0.0180887 = queryNorm
              0.22436246 = fieldWeight in 2124, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5897994 = idf(docFreq=3206, maxDocs=42740)
                0.0625 = fieldNorm(doc=2124)
          0.019037068 = weight(abstract_txt:wird in 2124) [ClassicSimilarity], result of:
            0.019037068 = score(doc=2124,freq=1.0), product of:
              0.08031792 = queryWeight, product of:
                1.1708399 = boost
                3.7923427 = idf(docFreq=2618, maxDocs=42740)
                0.0180887 = queryNorm
              0.23702142 = fieldWeight in 2124, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7923427 = idf(docFreq=2618, maxDocs=42740)
                0.0625 = fieldNorm(doc=2124)
          0.021867719 = weight(abstract_txt:nicht in 2124) [ClassicSimilarity], result of:
            0.021867719 = score(doc=2124,freq=1.0), product of:
              0.08809435 = queryWeight, product of:
                1.2262114 = boost
                3.9716904 = idf(docFreq=2188, maxDocs=42740)
                0.0180887 = queryNorm
              0.24823065 = fieldWeight in 2124, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9716904 = idf(docFreq=2188, maxDocs=42740)
                0.0625 = fieldNorm(doc=2124)
          0.07762826 = weight(abstract_txt:automatische in 2124) [ClassicSimilarity], result of:
            0.07762826 = score(doc=2124,freq=1.0), product of:
              0.17908612 = queryWeight, product of:
                1.4275017 = boost
                6.9355025 = idf(docFreq=112, maxDocs=42740)
                0.0180887 = queryNorm
              0.4334689 = fieldWeight in 2124, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9355025 = idf(docFreq=112, maxDocs=42740)
                0.0625 = fieldNorm(doc=2124)
          0.04750235 = weight(abstract_txt:wissen in 2124) [ClassicSimilarity], result of:
            0.04750235 = score(doc=2124,freq=1.0), product of:
              0.14775963 = queryWeight, product of:
                1.5880684 = boost
                5.143743 = idf(docFreq=677, maxDocs=42740)
                0.0180887 = queryNorm
              0.32148394 = fieldWeight in 2124, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.143743 = idf(docFreq=677, maxDocs=42740)
                0.0625 = fieldNorm(doc=2124)
          0.0671909 = weight(abstract_txt:verfahren in 2124) [ClassicSimilarity], result of:
            0.0671909 = score(doc=2124,freq=1.0), product of:
              0.18618844 = queryWeight, product of:
                1.7826564 = boost
                5.7740126 = idf(docFreq=360, maxDocs=42740)
                0.0180887 = queryNorm
              0.3608758 = fieldWeight in 2124, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7740126 = idf(docFreq=360, maxDocs=42740)
                0.0625 = fieldNorm(doc=2124)
          0.154207 = weight(abstract_txt:unstrukturierten in 2124) [ClassicSimilarity], result of:
            0.154207 = score(doc=2124,freq=1.0), product of:
              0.2829989 = queryWeight, product of:
                1.7944776 = boost
                8.7184515 = idf(docFreq=18, maxDocs=42740)
                0.0180887 = queryNorm
              0.5449032 = fieldWeight in 2124, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.7184515 = idf(docFreq=18, maxDocs=42740)
                0.0625 = fieldNorm(doc=2124)
          0.06834842 = weight(abstract_txt:werden in 2124) [ClassicSimilarity], result of:
            0.06834842 = score(doc=2124,freq=5.0), product of:
              0.13875589 = queryWeight, product of:
                2.1763663 = boost
                3.524618 = idf(docFreq=3422, maxDocs=42740)
                0.0180887 = queryNorm
              0.49258032 = fieldWeight in 2124, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.524618 = idf(docFreq=3422, maxDocs=42740)
                0.0625 = fieldNorm(doc=2124)
          0.085934006 = weight(abstract_txt:informationen in 2124) [ClassicSimilarity], result of:
            0.085934006 = score(doc=2124,freq=1.0), product of:
              0.27639645 = queryWeight, product of:
                3.0716555 = boost
                4.974536 = idf(docFreq=802, maxDocs=42740)
                0.0180887 = queryNorm
              0.3109085 = fieldWeight in 2124, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.974536 = idf(docFreq=802, maxDocs=42740)
                0.0625 = fieldNorm(doc=2124)
        0.36 = coord(9/25)