Document (#31987)

Author
Halip, I.
Title
Automatische Extrahierung von Schlagworten aus unstrukturierten Texten
Imprint
Münster : Institut für Wirtschaftsinformatik der Westfälische Wilhelms-Universität Münster
Year
2005
Pages
24 S
Abstract
Durch die zunehmende Mediatisierung und Digitalisierung wird die moderne Gesellschaft immer mehr mit dem Thema der Informationsüberflutung konfrontiert. Erstaunlicherweise führt der Zuwachs an Informationen gleichzeitig zu einem Mangel an Wissen. Die Erklärung kann darin gefunden werden, dass ein großer Teil der existierenden Informationen nicht aufgefunden werden kann. Es handelt sich meistens um Informationen die auf semi- und nichtstrukturierte Daten beruhen. Schätzungen zufolge sind heute rund 80% der entscheidungsrelevanten Informationen in Unternehmen in unstrukturierter, d. h. meist textueller Form vorhanden. Die Unfähigkeit der Maschinen den Inhalt unstrukturierter Texte zu verstehen führt dazu, dass dokumentiertes Wissen schwer auffindbar ist und oft unentdeckt bleibt. Wegen des Informationsvolumens, das meistens zu groß ist, um gelesen, verstanden oder sogar benutzt zu werden, ergibt sich folgendes Problem, mit dem man konfrontiert wird: Informationen die nicht in Wissen umgewandelt werden können, bleiben als papiergebundene oder digitale Dokumente in Data-Repositories verschlossen. Angesichts der heute anfallenden Menge an Dokumenten erscheint eine manuelle Vergabe von Schlagworten nicht mehr realistisch. Deshalb entwickelt Wissensmanagement unterstützende Verfahren, die Informationen rechtzeitig, in der richtigen Qualität und den richtigen Personen verfügbar machen. Einige Schwerpunkte an denen zur Zeit geforscht wird, sind Modelle zur Repräsentation von Dokumenten, Methoden zur Ähnlichkeitsbestimmung von Anfragen zu Dokumenten und zur Indexierung von Dokumentenmengen, sowie die automatische Klassifikation. Vor diesem Hintergrund konzentriert sich diese Arbeit auf die unterschiedlichen Verfahren der automatischen Indexierung, hebt die algorithmischen Vor- und Nachteile hervor, mit dem Ziel die Funktionsweise im Bereich der unstrukturierten Texte zu analysieren. Hierfür erfolgt im 3. Kapitel eine genauere Untersuchung und Darstellung automatischer Indexierungsverfahren. Zuvor werden in Kapitel 2 grundlegende Begrifflichkeiten erklärt, eingeordnet und abgegrenzt. Abschließend werden anhand der theoretischen Darlegung Implementierungen der vorgestellten Verfahren kurz beschrieben. Die Ausarbeitung endet mit der Schlussfolgerung und dem Ausblick.
Content
Ausarbeitung im Rahmen des Seminars Suchmaschinen und Suchalgorithmen, Institut für Wirtschaftsinformatik Praktische Informatik in der Wirtschaft, Westfälische Wilhelms-Universität Münster
Theme
Automatisches Indexieren
Object
IDX

Similar documents (content)

  1. Loonus, Y.: Einsatzbereiche der KI und ihre Relevanz für Information Professionals (2017) 0.28
    0.28058216 = sum of:
      0.28058216 = product of:
        0.87681925 = sum of:
          0.033193942 = weight(abstract_txt:mehr in 5668) [ClassicSimilarity], result of:
            0.033193942 = score(doc=5668,freq=1.0), product of:
              0.08764402 = queryWeight, product of:
                4.8478208 = idf(docFreq=942, maxDocs=44218)
                0.018079055 = queryNorm
              0.378736 = fieldWeight in 5668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8478208 = idf(docFreq=942, maxDocs=44218)
                0.078125 = fieldNorm(doc=5668)
          0.026813999 = weight(abstract_txt:nicht in 5668) [ClassicSimilarity], result of:
            0.026813999 = score(doc=5668,freq=1.0), product of:
              0.08702042 = queryWeight, product of:
                1.22038 = boost
                3.9441223 = idf(docFreq=2327, maxDocs=44218)
                0.018079055 = queryNorm
              0.30813456 = fieldWeight in 5668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9441223 = idf(docFreq=2327, maxDocs=44218)
                0.078125 = fieldNorm(doc=5668)
          0.072034605 = weight(abstract_txt:führt in 5668) [ClassicSimilarity], result of:
            0.072034605 = score(doc=5668,freq=1.0), product of:
              0.14690745 = queryWeight, product of:
                1.294675 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.018079055 = queryNorm
              0.49034002 = fieldWeight in 5668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.078125 = fieldNorm(doc=5668)
          0.090680465 = weight(abstract_txt:texte in 5668) [ClassicSimilarity], result of:
            0.090680465 = score(doc=5668,freq=1.0), product of:
              0.1712743 = queryWeight, product of:
                1.3979285 = boost
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.018079055 = queryNorm
              0.5294458 = fieldWeight in 5668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.078125 = fieldNorm(doc=5668)
          0.18872252 = weight(abstract_txt:unstrukturierten in 5668) [ClassicSimilarity], result of:
            0.18872252 = score(doc=5668,freq=1.0), product of:
              0.27918935 = queryWeight, product of:
                1.7847947 = boost
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.018079055 = queryNorm
              0.675966 = fieldWeight in 5668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.078125 = fieldNorm(doc=5668)
          0.11697718 = weight(abstract_txt:dokumenten in 5668) [ClassicSimilarity], result of:
            0.11697718 = score(doc=5668,freq=1.0), product of:
              0.23233478 = queryWeight, product of:
                1.9940755 = boost
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.018079055 = queryNorm
              0.50348544 = fieldWeight in 5668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.078125 = fieldNorm(doc=5668)
          0.28313884 = weight(abstract_txt:unstrukturierter in 5668) [ClassicSimilarity], result of:
            0.28313884 = score(doc=5668,freq=1.0), product of:
              0.36588898 = queryWeight, product of:
                2.0432124 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.018079055 = queryNorm
              0.7738381 = fieldWeight in 5668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.078125 = fieldNorm(doc=5668)
          0.0652577 = weight(abstract_txt:werden in 5668) [ClassicSimilarity], result of:
            0.0652577 = score(doc=5668,freq=3.0), product of:
              0.1375427 = queryWeight, product of:
                2.1697927 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.018079055 = queryNorm
              0.47445413 = fieldWeight in 5668, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.078125 = fieldNorm(doc=5668)
        0.32 = coord(8/25)
    
  2. Kaiser, A.: Computer-unterstütztes Indexieren in Intelligenten Information Retrieval Systemen : Ein Relevanz-Feedback orientierter Ansatz zur Informationserschließung in unformatierten Datenbanken (1993) 0.27
    0.26536232 = sum of:
      0.26536232 = product of:
        0.5528382 = sum of:
          0.015755901 = weight(abstract_txt:sich in 4284) [ClassicSimilarity], result of:
            0.015755901 = score(doc=4284,freq=4.0), product of:
              0.07084061 = queryWeight, product of:
                1.1010971 = boost
                3.5586145 = idf(docFreq=3422, maxDocs=44218)
                0.018079055 = queryNorm
              0.2224134 = fieldWeight in 4284, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.5586145 = idf(docFreq=3422, maxDocs=44218)
                0.03125 = fieldNorm(doc=4284)
          0.018519677 = weight(abstract_txt:heute in 4284) [ClassicSimilarity], result of:
            0.018519677 = score(doc=4284,freq=1.0), product of:
              0.10941209 = queryWeight, product of:
                1.1173044 = boost
                5.4164915 = idf(docFreq=533, maxDocs=44218)
                0.018079055 = queryNorm
              0.16926536 = fieldWeight in 4284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4164915 = idf(docFreq=533, maxDocs=44218)
                0.03125 = fieldNorm(doc=4284)
          0.020997955 = weight(abstract_txt:wird in 4284) [ClassicSimilarity], result of:
            0.020997955 = score(doc=4284,freq=5.0), product of:
              0.07964065 = queryWeight, product of:
                1.1674865 = boost
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.018079055 = queryNorm
              0.26365876 = fieldWeight in 4284, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.03125 = fieldNorm(doc=4284)
          0.0214512 = weight(abstract_txt:nicht in 4284) [ClassicSimilarity], result of:
            0.0214512 = score(doc=4284,freq=4.0), product of:
              0.08702042 = queryWeight, product of:
                1.22038 = boost
                3.9441223 = idf(docFreq=2327, maxDocs=44218)
                0.018079055 = queryNorm
              0.24650764 = fieldWeight in 4284, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.9441223 = idf(docFreq=2327, maxDocs=44218)
                0.03125 = fieldNorm(doc=4284)
          0.08799907 = weight(abstract_txt:indexierung in 4284) [ClassicSimilarity], result of:
            0.08799907 = score(doc=4284,freq=6.0), product of:
              0.1701811 = queryWeight, product of:
                1.39346 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.018079055 = queryNorm
              0.51709074 = fieldWeight in 4284, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.03125 = fieldNorm(doc=4284)
          0.038581256 = weight(abstract_txt:automatische in 4284) [ClassicSimilarity], result of:
            0.038581256 = score(doc=4284,freq=1.0), product of:
              0.17846811 = queryWeight, product of:
                1.4269842 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.018079055 = queryNorm
              0.21618012 = fieldWeight in 4284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.03125 = fieldNorm(doc=4284)
          0.03338044 = weight(abstract_txt:wissen in 4284) [ClassicSimilarity], result of:
            0.03338044 = score(doc=4284,freq=2.0), product of:
              0.14722838 = queryWeight, product of:
                1.5873775 = boost
                5.1302147 = idf(docFreq=710, maxDocs=44218)
                0.018079055 = queryNorm
              0.2267256 = fieldWeight in 4284, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1302147 = idf(docFreq=710, maxDocs=44218)
                0.03125 = fieldNorm(doc=4284)
          0.055852994 = weight(abstract_txt:konfrontiert in 4284) [ClassicSimilarity], result of:
            0.055852994 = score(doc=4284,freq=1.0), product of:
              0.22838838 = queryWeight, product of:
                1.6142688 = boost
                7.825686 = idf(docFreq=47, maxDocs=44218)
                0.018079055 = queryNorm
              0.24455269 = fieldWeight in 4284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.825686 = idf(docFreq=47, maxDocs=44218)
                0.03125 = fieldNorm(doc=4284)
          0.033441722 = weight(abstract_txt:verfahren in 4284) [ClassicSimilarity], result of:
            0.033441722 = score(doc=4284,freq=1.0), product of:
              0.18572308 = queryWeight, product of:
                1.7828608 = boost
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.018079055 = queryNorm
              0.18006228 = fieldWeight in 4284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.03125 = fieldNorm(doc=4284)
          0.09358174 = weight(abstract_txt:dokumenten in 4284) [ClassicSimilarity], result of:
            0.09358174 = score(doc=4284,freq=4.0), product of:
              0.23233478 = queryWeight, product of:
                1.9940755 = boost
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.018079055 = queryNorm
              0.40278837 = fieldWeight in 4284, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.03125 = fieldNorm(doc=4284)
          0.03691533 = weight(abstract_txt:werden in 4284) [ClassicSimilarity], result of:
            0.03691533 = score(doc=4284,freq=6.0), product of:
              0.1375427 = queryWeight, product of:
                2.1697927 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.018079055 = queryNorm
              0.2683918 = fieldWeight in 4284, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.03125 = fieldNorm(doc=4284)
          0.09636097 = weight(abstract_txt:informationen in 4284) [ClassicSimilarity], result of:
            0.09636097 = score(doc=4284,freq=5.0), product of:
              0.27709427 = queryWeight, product of:
                3.0797343 = boost
                4.976667 = idf(docFreq=828, maxDocs=44218)
                0.018079055 = queryNorm
              0.34775516 = fieldWeight in 4284, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.976667 = idf(docFreq=828, maxDocs=44218)
                0.03125 = fieldNorm(doc=4284)
        0.48 = coord(12/25)
    
  3. Nohr, H.: Automatische Indexierung : Einführung in betriebliche Verfahren, Systeme und Anwendungen (2001) 0.26
    0.25621262 = sum of:
      0.25621262 = product of:
        0.8006644 = sum of:
          0.026555156 = weight(abstract_txt:mehr in 2543) [ClassicSimilarity], result of:
            0.026555156 = score(doc=2543,freq=1.0), product of:
              0.08764402 = queryWeight, product of:
                4.8478208 = idf(docFreq=942, maxDocs=44218)
                0.018079055 = queryNorm
              0.3029888 = fieldWeight in 2543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8478208 = idf(docFreq=942, maxDocs=44218)
                0.0625 = fieldNorm(doc=2543)
          0.015755901 = weight(abstract_txt:sich in 2543) [ClassicSimilarity], result of:
            0.015755901 = score(doc=2543,freq=1.0), product of:
              0.07084061 = queryWeight, product of:
                1.1010971 = boost
                3.5586145 = idf(docFreq=3422, maxDocs=44218)
                0.018079055 = queryNorm
              0.2224134 = fieldWeight in 2543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5586145 = idf(docFreq=3422, maxDocs=44218)
                0.0625 = fieldNorm(doc=2543)
          0.14370187 = weight(abstract_txt:indexierung in 2543) [ClassicSimilarity], result of:
            0.14370187 = score(doc=2543,freq=4.0), product of:
              0.1701811 = queryWeight, product of:
                1.39346 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.018079055 = queryNorm
              0.8444056 = fieldWeight in 2543, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.0625 = fieldNorm(doc=2543)
          0.14955592 = weight(abstract_txt:verfahren in 2543) [ClassicSimilarity], result of:
            0.14955592 = score(doc=2543,freq=5.0), product of:
              0.18572308 = queryWeight, product of:
                1.7828608 = boost
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.018079055 = queryNorm
              0.805263 = fieldWeight in 2543, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.0625 = fieldNorm(doc=2543)
          0.150978 = weight(abstract_txt:unstrukturierten in 2543) [ClassicSimilarity], result of:
            0.150978 = score(doc=2543,freq=1.0), product of:
              0.27918935 = queryWeight, product of:
                1.7847947 = boost
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.018079055 = queryNorm
              0.5407728 = fieldWeight in 2543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.0625 = fieldNorm(doc=2543)
          0.16208833 = weight(abstract_txt:dokumenten in 2543) [ClassicSimilarity], result of:
            0.16208833 = score(doc=2543,freq=3.0), product of:
              0.23233478 = queryWeight, product of:
                1.9940755 = boost
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.018079055 = queryNorm
              0.6976499 = fieldWeight in 2543, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.0625 = fieldNorm(doc=2543)
          0.03014124 = weight(abstract_txt:werden in 2543) [ClassicSimilarity], result of:
            0.03014124 = score(doc=2543,freq=1.0), product of:
              0.1375427 = queryWeight, product of:
                2.1697927 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.018079055 = queryNorm
              0.21914098 = fieldWeight in 2543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.0625 = fieldNorm(doc=2543)
          0.12188805 = weight(abstract_txt:informationen in 2543) [ClassicSimilarity], result of:
            0.12188805 = score(doc=2543,freq=2.0), product of:
              0.27709427 = queryWeight, product of:
                3.0797343 = boost
                4.976667 = idf(docFreq=828, maxDocs=44218)
                0.018079055 = queryNorm
              0.43987936 = fieldWeight in 2543, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.976667 = idf(docFreq=828, maxDocs=44218)
                0.0625 = fieldNorm(doc=2543)
        0.32 = coord(8/25)
    
  4. Helmbrecht-Schaar, A.: Entwicklung eines Verfahrens der automatischen Klassifizierung für Textdokumente aus dem Fachbereich Informatik mithilfe eines fachspezifischen Klassifikationssystems (2007) 0.20
    0.20260346 = sum of:
      0.20260346 = product of:
        0.72358376 = sum of:
          0.046952855 = weight(abstract_txt:wird in 1410) [ClassicSimilarity], result of:
            0.046952855 = score(doc=1410,freq=4.0), product of:
              0.07964065 = queryWeight, product of:
                1.1674865 = boost
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.018079055 = queryNorm
              0.5895589 = fieldWeight in 1410, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.078125 = fieldNorm(doc=1410)
          0.090680465 = weight(abstract_txt:texte in 1410) [ClassicSimilarity], result of:
            0.090680465 = score(doc=1410,freq=1.0), product of:
              0.1712743 = queryWeight, product of:
                1.3979285 = boost
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.018079055 = queryNorm
              0.5294458 = fieldWeight in 1410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.078125 = fieldNorm(doc=1410)
          0.09645314 = weight(abstract_txt:automatische in 1410) [ClassicSimilarity], result of:
            0.09645314 = score(doc=1410,freq=1.0), product of:
              0.17846811 = queryWeight, product of:
                1.4269842 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.018079055 = queryNorm
              0.5404503 = fieldWeight in 1410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.078125 = fieldNorm(doc=1410)
          0.1448069 = weight(abstract_txt:verfahren in 1410) [ClassicSimilarity], result of:
            0.1448069 = score(doc=1410,freq=3.0), product of:
              0.18572308 = queryWeight, product of:
                1.7828608 = boost
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.018079055 = queryNorm
              0.77969253 = fieldWeight in 1410, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.078125 = fieldNorm(doc=1410)
          0.11697718 = weight(abstract_txt:dokumenten in 1410) [ClassicSimilarity], result of:
            0.11697718 = score(doc=1410,freq=1.0), product of:
              0.23233478 = queryWeight, product of:
                1.9940755 = boost
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.018079055 = queryNorm
              0.50348544 = fieldWeight in 1410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.078125 = fieldNorm(doc=1410)
          0.0753531 = weight(abstract_txt:werden in 1410) [ClassicSimilarity], result of:
            0.0753531 = score(doc=1410,freq=4.0), product of:
              0.1375427 = queryWeight, product of:
                2.1697927 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.018079055 = queryNorm
              0.54785246 = fieldWeight in 1410, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.078125 = fieldNorm(doc=1410)
          0.15236007 = weight(abstract_txt:informationen in 1410) [ClassicSimilarity], result of:
            0.15236007 = score(doc=1410,freq=2.0), product of:
              0.27709427 = queryWeight, product of:
                3.0797343 = boost
                4.976667 = idf(docFreq=828, maxDocs=44218)
                0.018079055 = queryNorm
              0.5498492 = fieldWeight in 1410, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.976667 = idf(docFreq=828, maxDocs=44218)
                0.078125 = fieldNorm(doc=1410)
        0.28 = coord(7/25)
    
  5. Witschel, H.F.: Terminologie-Extraktion : Möglichkeiten der Kombination statistischer uns musterbasierter Verfahren (2004) 0.20
    0.19864981 = sum of:
      0.19864981 = product of:
        0.551805 = sum of:
          0.015755901 = weight(abstract_txt:sich in 123) [ClassicSimilarity], result of:
            0.015755901 = score(doc=123,freq=1.0), product of:
              0.07084061 = queryWeight, product of:
                1.1010971 = boost
                3.5586145 = idf(docFreq=3422, maxDocs=44218)
                0.018079055 = queryNorm
              0.2224134 = fieldWeight in 123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5586145 = idf(docFreq=3422, maxDocs=44218)
                0.0625 = fieldNorm(doc=123)
          0.01878114 = weight(abstract_txt:wird in 123) [ClassicSimilarity], result of:
            0.01878114 = score(doc=123,freq=1.0), product of:
              0.07964065 = queryWeight, product of:
                1.1674865 = boost
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.018079055 = queryNorm
              0.23582356 = fieldWeight in 123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.0625 = fieldNorm(doc=123)
          0.0214512 = weight(abstract_txt:nicht in 123) [ClassicSimilarity], result of:
            0.0214512 = score(doc=123,freq=1.0), product of:
              0.08702042 = queryWeight, product of:
                1.22038 = boost
                3.9441223 = idf(docFreq=2327, maxDocs=44218)
                0.018079055 = queryNorm
              0.24650764 = fieldWeight in 123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9441223 = idf(docFreq=2327, maxDocs=44218)
                0.0625 = fieldNorm(doc=123)
          0.07716251 = weight(abstract_txt:automatische in 123) [ClassicSimilarity], result of:
            0.07716251 = score(doc=123,freq=1.0), product of:
              0.17846811 = queryWeight, product of:
                1.4269842 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.018079055 = queryNorm
              0.43236023 = fieldWeight in 123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.0625 = fieldNorm(doc=123)
          0.047207072 = weight(abstract_txt:wissen in 123) [ClassicSimilarity], result of:
            0.047207072 = score(doc=123,freq=1.0), product of:
              0.14722838 = queryWeight, product of:
                1.5873775 = boost
                5.1302147 = idf(docFreq=710, maxDocs=44218)
                0.018079055 = queryNorm
              0.32063842 = fieldWeight in 123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1302147 = idf(docFreq=710, maxDocs=44218)
                0.0625 = fieldNorm(doc=123)
          0.066883445 = weight(abstract_txt:verfahren in 123) [ClassicSimilarity], result of:
            0.066883445 = score(doc=123,freq=1.0), product of:
              0.18572308 = queryWeight, product of:
                1.7828608 = boost
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.018079055 = queryNorm
              0.36012456 = fieldWeight in 123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.0625 = fieldNorm(doc=123)
          0.150978 = weight(abstract_txt:unstrukturierten in 123) [ClassicSimilarity], result of:
            0.150978 = score(doc=123,freq=1.0), product of:
              0.27918935 = queryWeight, product of:
                1.7847947 = boost
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.018079055 = queryNorm
              0.5407728 = fieldWeight in 123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.0625 = fieldNorm(doc=123)
          0.06739786 = weight(abstract_txt:werden in 123) [ClassicSimilarity], result of:
            0.06739786 = score(doc=123,freq=5.0), product of:
              0.1375427 = queryWeight, product of:
                2.1697927 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.018079055 = queryNorm
              0.49001414 = fieldWeight in 123, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.0625 = fieldNorm(doc=123)
          0.08618787 = weight(abstract_txt:informationen in 123) [ClassicSimilarity], result of:
            0.08618787 = score(doc=123,freq=1.0), product of:
              0.27709427 = queryWeight, product of:
                3.0797343 = boost
                4.976667 = idf(docFreq=828, maxDocs=44218)
                0.018079055 = queryNorm
              0.31104168 = fieldWeight in 123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.976667 = idf(docFreq=828, maxDocs=44218)
                0.0625 = fieldNorm(doc=123)
        0.36 = coord(9/25)