Document (#37127)

Author
Witschel, H.F.
Title
Text, Wörter, Morpheme : Möglichkeiten einer automatischen Terminologie-Extraktion
Imprint
Leipzig : Universität / Fakultät für Mathematik und Informatik Institut für Informatik
Year
2004
Pages
141 S
Abstract
Die vorliegende Arbeit beschäftigt sich mit einem Teilgebiet des TextMining, versucht also Information (in diesem Fall Fachterminologie) aus natürlichsprachlichem Text zu extrahieren. Die der Arbeit zugrundeliegende These besagt, daß in vielen Gebieten des Text Mining die Kombination verschiedener Methoden sinnvoll sein kann, um dem Facettenreichtum natürlicher Sprache gerecht zu werden. Die bei der Terminologie-Extraktion angewandten Methoden sind statistischer und linguistischer (bzw. musterbasierter) Natur. Um sie herzuleiten, wurden einige Eigenschaften von Fachtermini herausgearbeitet, die für deren Extraktion relevant sind. So läßt sich z.B. die Tatsache, daß viele Fachbegriffe Nominalphrasen einer bestimmten Form sind, direkt für eine Suche nach gewissen POS-Mustern ausnützen, die Verteilung von Termen in Fachtexten führte zu einem statistischen Ansatz - der Differenzanalyse. Zusammen mit einigen weiteren wurden diese Ansätze in ein Verfahren integriert, welches in der Lage ist, aus dem Feedback eines Anwenders zu lernen und in mehreren Schritten die Suche nach Terminologie zu verfeinern. Dabei wurden mehrere Parameter des Verfahrens veränderlich belassen, d.h. der Anwender kann sie beliebig anpassen. Bei der Untersuchung der Ergebnisse anhand von zwei Fachtexten aus unterschiedlichen Domänen wurde deutlich, daß sich zwar die verschiedenen Verfahren gut ergänzen, daß aber die optimalen Werte der veränderbaren Parameter, ja selbst die Auswahl der angewendeten Verfahren text- und domänenabhängig sind.
Theme
Computerlinguistik
Data Mining

Similar documents (author)

  1. Witschel, H.F.: Global term weights in distributed environments (2008) 6.19
    6.190705 = sum of:
      6.190705 = weight(author_txt:witschel in 2096) [ClassicSimilarity], result of:
        6.190705 = fieldWeight in 2096, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.625 = fieldNorm(doc=2096)
    
  2. Witschel, H.F.: Terminologie-Extraktion : Möglichkeiten der Kombination statistischer uns musterbasierter Verfahren (2004) 6.19
    6.190705 = sum of:
      6.190705 = weight(author_txt:witschel in 123) [ClassicSimilarity], result of:
        6.190705 = fieldWeight in 123, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.625 = fieldNorm(doc=123)
    
  3. Witschel, H.F.: Global and local resources for peer-to-peer text retrieval (2008) 6.19
    6.190705 = sum of:
      6.190705 = weight(author_txt:witschel in 127) [ClassicSimilarity], result of:
        6.190705 = fieldWeight in 127, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.625 = fieldNorm(doc=127)
    
  4. Witschel, H.F.: Terminology extraction and automatic indexing : comparison and qualitative evaluation of methods (2005) 6.19
    6.190705 = sum of:
      6.190705 = weight(author_txt:witschel in 1842) [ClassicSimilarity], result of:
        6.190705 = fieldWeight in 1842, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.625 = fieldNorm(doc=1842)
    

Similar documents (content)

  1. Witschel, H.F.: Terminologie-Extraktion : Möglichkeiten der Kombination statistischer uns musterbasierter Verfahren (2004) 1.07
    1.06778 = sum of:
      1.06778 = product of:
        1.7796333 = sum of:
          0.08579144 = weight(abstract_txt:extrahieren in 123) [ClassicSimilarity], result of:
            0.08579144 = score(doc=123,freq=1.0), product of:
              0.15486388 = queryWeight, product of:
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.017471747 = queryNorm
              0.55397964 = fieldWeight in 123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.0625 = fieldNorm(doc=123)
          0.022545883 = weight(abstract_txt:kann in 123) [ClassicSimilarity], result of:
            0.022545883 = score(doc=123,freq=1.0), product of:
              0.080052644 = queryWeight, product of:
                1.0167818 = boost
                4.5062113 = idf(docFreq=1326, maxDocs=44218)
                0.017471747 = queryNorm
              0.2816382 = fieldWeight in 123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5062113 = idf(docFreq=1326, maxDocs=44218)
                0.0625 = fieldNorm(doc=123)
          0.09381915 = weight(abstract_txt:fachbegriffe in 123) [ClassicSimilarity], result of:
            0.09381915 = score(doc=123,freq=1.0), product of:
              0.16437982 = queryWeight, product of:
                1.0302656 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.017471747 = queryNorm
              0.5707461 = fieldWeight in 123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.0625 = fieldNorm(doc=123)
          0.09630783 = weight(abstract_txt:teilgebiet in 123) [ClassicSimilarity], result of:
            0.09630783 = score(doc=123,freq=1.0), product of:
              0.16727404 = queryWeight, product of:
                1.0392959 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.017471747 = queryNorm
              0.5757488 = fieldWeight in 123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.0625 = fieldNorm(doc=123)
          0.20736879 = weight(abstract_txt:fachtermini in 123) [ClassicSimilarity], result of:
            0.20736879 = score(doc=123,freq=3.0), product of:
              0.19339387 = queryWeight, product of:
                1.1174968 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.017471747 = queryNorm
              1.0722615 = fieldWeight in 123, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.0625 = fieldNorm(doc=123)
          0.036509685 = weight(abstract_txt:arbeit in 123) [ClassicSimilarity], result of:
            0.036509685 = score(doc=123,freq=1.0), product of:
              0.110391654 = queryWeight, product of:
                1.1940103 = boost
                5.291659 = idf(docFreq=604, maxDocs=44218)
                0.017471747 = queryNorm
              0.33072868 = fieldWeight in 123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.291659 = idf(docFreq=604, maxDocs=44218)
                0.0625 = fieldNorm(doc=123)
          0.016655844 = weight(abstract_txt:sich in 123) [ClassicSimilarity], result of:
            0.016655844 = score(doc=123,freq=1.0), product of:
              0.07488687 = queryWeight, product of:
                1.2044489 = boost
                3.5586145 = idf(docFreq=3422, maxDocs=44218)
                0.017471747 = queryNorm
              0.2224134 = fieldWeight in 123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5586145 = idf(docFreq=3422, maxDocs=44218)
                0.0625 = fieldNorm(doc=123)
          0.043505818 = weight(abstract_txt:suche in 123) [ClassicSimilarity], result of:
            0.043505818 = score(doc=123,freq=1.0), product of:
              0.12407827 = queryWeight, product of:
                1.2658662 = boost
                5.6101127 = idf(docFreq=439, maxDocs=44218)
                0.017471747 = queryNorm
              0.35063204 = fieldWeight in 123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6101127 = idf(docFreq=439, maxDocs=44218)
                0.0625 = fieldNorm(doc=123)
          0.04766271 = weight(abstract_txt:methoden in 123) [ClassicSimilarity], result of:
            0.04766271 = score(doc=123,freq=1.0), product of:
              0.13186109 = queryWeight, product of:
                1.3049632 = boost
                5.7833843 = idf(docFreq=369, maxDocs=44218)
                0.017471747 = queryNorm
              0.36146152 = fieldWeight in 123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7833843 = idf(docFreq=369, maxDocs=44218)
                0.0625 = fieldNorm(doc=123)
          0.041896526 = weight(abstract_txt:sind in 123) [ClassicSimilarity], result of:
            0.041896526 = score(doc=123,freq=2.0), product of:
              0.1209993 = queryWeight, product of:
                1.7678539 = boost
                3.9174201 = idf(docFreq=2390, maxDocs=44218)
                0.017471747 = queryNorm
              0.3462543 = fieldWeight in 123, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9174201 = idf(docFreq=2390, maxDocs=44218)
                0.0625 = fieldNorm(doc=123)
          0.046085894 = weight(abstract_txt:text in 123) [ClassicSimilarity], result of:
            0.046085894 = score(doc=123,freq=2.0), product of:
              0.12893659 = queryWeight, product of:
                1.8249166 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017471747 = queryNorm
              0.3574307 = fieldWeight in 123, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=123)
          0.07070368 = weight(abstract_txt:verfahren in 123) [ClassicSimilarity], result of:
            0.07070368 = score(doc=123,freq=1.0), product of:
              0.19633117 = queryWeight, product of:
                1.9502046 = boost
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.017471747 = queryNorm
              0.36012456 = fieldWeight in 123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.0625 = fieldNorm(doc=123)
          0.22844258 = weight(abstract_txt:fachtexten in 123) [ClassicSimilarity], result of:
            0.22844258 = score(doc=123,freq=1.0), product of:
              0.37484255 = queryWeight, product of:
                2.2002113 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.017471747 = queryNorm
              0.6094361 = fieldWeight in 123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.0625 = fieldNorm(doc=123)
          0.32062614 = weight(abstract_txt:terminologie in 123) [ClassicSimilarity], result of:
            0.32062614 = score(doc=123,freq=4.0), product of:
              0.33884978 = queryWeight, product of:
                2.5620594 = boost
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.017471747 = queryNorm
              0.9462191 = fieldWeight in 123, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.0625 = fieldNorm(doc=123)
          0.4217112 = weight(abstract_txt:extraktion in 123) [ClassicSimilarity], result of:
            0.4217112 = score(doc=123,freq=3.0), product of:
              0.44771084 = queryWeight, product of:
                2.9449937 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.017471747 = queryNorm
              0.9419276 = fieldWeight in 123, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.0625 = fieldNorm(doc=123)
        0.6 = coord(15/25)
    
  2. Bredack, J.: Terminologieextraktion von Mehrwortgruppen in kunsthistorischen Fachtexten (2013) 0.40
    0.40334183 = sum of:
      0.40334183 = product of:
        1.0083545 = sum of:
          0.11989718 = weight(abstract_txt:extrahieren in 1054) [ClassicSimilarity], result of:
            0.11989718 = score(doc=1054,freq=5.0), product of:
              0.15486388 = queryWeight, product of:
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.017471747 = queryNorm
              0.7742101 = fieldWeight in 1054, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1054)
          0.060192395 = weight(abstract_txt:fachterminologie in 1054) [ClassicSimilarity], result of:
            0.060192395 = score(doc=1054,freq=1.0), product of:
              0.16727404 = queryWeight, product of:
                1.0392959 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.017471747 = queryNorm
              0.35984302 = fieldWeight in 1054, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1054)
          0.03952289 = weight(abstract_txt:arbeit in 1054) [ClassicSimilarity], result of:
            0.03952289 = score(doc=1054,freq=3.0), product of:
              0.110391654 = queryWeight, product of:
                1.1940103 = boost
                5.291659 = idf(docFreq=604, maxDocs=44218)
                0.017471747 = queryNorm
              0.35802427 = fieldWeight in 1054, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.291659 = idf(docFreq=604, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1054)
          0.014721826 = weight(abstract_txt:sich in 1054) [ClassicSimilarity], result of:
            0.014721826 = score(doc=1054,freq=2.0), product of:
              0.07488687 = queryWeight, product of:
                1.2044489 = boost
                3.5586145 = idf(docFreq=3422, maxDocs=44218)
                0.017471747 = queryNorm
              0.19658753 = fieldWeight in 1054, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5586145 = idf(docFreq=3422, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1054)
          0.056260377 = weight(abstract_txt:wurden in 1054) [ClassicSimilarity], result of:
            0.056260377 = score(doc=1054,freq=3.0), product of:
              0.15990765 = queryWeight, product of:
                1.7600305 = boost
                5.2001123 = idf(docFreq=662, maxDocs=44218)
                0.017471747 = queryNorm
              0.35183042 = fieldWeight in 1054, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.2001123 = idf(docFreq=662, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1054)
          0.02618533 = weight(abstract_txt:sind in 1054) [ClassicSimilarity], result of:
            0.02618533 = score(doc=1054,freq=2.0), product of:
              0.1209993 = queryWeight, product of:
                1.7678539 = boost
                3.9174201 = idf(docFreq=2390, maxDocs=44218)
                0.017471747 = queryNorm
              0.21640894 = fieldWeight in 1054, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9174201 = idf(docFreq=2390, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1054)
          0.02036728 = weight(abstract_txt:text in 1054) [ClassicSimilarity], result of:
            0.02036728 = score(doc=1054,freq=1.0), product of:
              0.12893659 = queryWeight, product of:
                1.8249166 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017471747 = queryNorm
              0.15796354 = fieldWeight in 1054, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1054)
          0.0883796 = weight(abstract_txt:verfahren in 1054) [ClassicSimilarity], result of:
            0.0883796 = score(doc=1054,freq=4.0), product of:
              0.19633117 = queryWeight, product of:
                1.9502046 = boost
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.017471747 = queryNorm
              0.4501557 = fieldWeight in 1054, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1054)
          0.3192582 = weight(abstract_txt:fachtexten in 1054) [ClassicSimilarity], result of:
            0.3192582 = score(doc=1054,freq=5.0), product of:
              0.37484255 = queryWeight, product of:
                2.2002113 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.017471747 = queryNorm
              0.8517128 = fieldWeight in 1054, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1054)
          0.2635695 = weight(abstract_txt:extraktion in 1054) [ClassicSimilarity], result of:
            0.2635695 = score(doc=1054,freq=3.0), product of:
              0.44771084 = queryWeight, product of:
                2.9449937 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.017471747 = queryNorm
              0.58870476 = fieldWeight in 1054, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1054)
        0.4 = coord(10/25)
    
  3. Terminologie : Epochen - Schwerpunkte - Umsetzungen : zum 25-jährigen Bestehen des Rats für Deutschsprachige Terminologie (2019) 0.18
    0.18127596 = sum of:
      0.18127596 = product of:
        0.9063798 = sum of:
          0.04079832 = weight(abstract_txt:sich in 5602) [ClassicSimilarity], result of:
            0.04079832 = score(doc=5602,freq=6.0), product of:
              0.07488687 = queryWeight, product of:
                1.2044489 = boost
                3.5586145 = idf(docFreq=3422, maxDocs=44218)
                0.017471747 = queryNorm
              0.5447994 = fieldWeight in 5602, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.5586145 = idf(docFreq=3422, maxDocs=44218)
                0.0625 = fieldNorm(doc=5602)
          0.04766271 = weight(abstract_txt:methoden in 5602) [ClassicSimilarity], result of:
            0.04766271 = score(doc=5602,freq=1.0), product of:
              0.13186109 = queryWeight, product of:
                1.3049632 = boost
                5.7833843 = idf(docFreq=369, maxDocs=44218)
                0.017471747 = queryNorm
              0.36146152 = fieldWeight in 5602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7833843 = idf(docFreq=369, maxDocs=44218)
                0.0625 = fieldNorm(doc=5602)
          0.07070368 = weight(abstract_txt:verfahren in 5602) [ClassicSimilarity], result of:
            0.07070368 = score(doc=5602,freq=1.0), product of:
              0.19633117 = queryWeight, product of:
                1.9502046 = boost
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.017471747 = queryNorm
              0.36012456 = fieldWeight in 5602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.0625 = fieldNorm(doc=5602)
          0.3230666 = weight(abstract_txt:fachtexten in 5602) [ClassicSimilarity], result of:
            0.3230666 = score(doc=5602,freq=2.0), product of:
              0.37484255 = queryWeight, product of:
                2.2002113 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.017471747 = queryNorm
              0.8618728 = fieldWeight in 5602, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.0625 = fieldNorm(doc=5602)
          0.4241485 = weight(abstract_txt:terminologie in 5602) [ClassicSimilarity], result of:
            0.4241485 = score(doc=5602,freq=7.0), product of:
              0.33884978 = queryWeight, product of:
                2.5620594 = boost
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.017471747 = queryNorm
              1.2517302 = fieldWeight in 5602, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.0625 = fieldNorm(doc=5602)
        0.2 = coord(5/25)
    
  4. Ingold, M.: ¬Das bibliothekarische Konzept der Informationskompetenz : ein Überblick (2005) 0.13
    0.1341959 = sum of:
      0.1341959 = product of:
        0.67097944 = sum of:
          0.19261566 = weight(abstract_txt:teilgebiet in 1413) [ClassicSimilarity], result of:
            0.19261566 = score(doc=1413,freq=1.0), product of:
              0.16727404 = queryWeight, product of:
                1.0392959 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.017471747 = queryNorm
              1.1514976 = fieldWeight in 1413, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.125 = fieldNorm(doc=1413)
          0.033311687 = weight(abstract_txt:sich in 1413) [ClassicSimilarity], result of:
            0.033311687 = score(doc=1413,freq=1.0), product of:
              0.07488687 = queryWeight, product of:
                1.2044489 = boost
                3.5586145 = idf(docFreq=3422, maxDocs=44218)
                0.017471747 = queryNorm
              0.4448268 = fieldWeight in 1413, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5586145 = idf(docFreq=3422, maxDocs=44218)
                0.125 = fieldNorm(doc=1413)
          0.059250638 = weight(abstract_txt:sind in 1413) [ClassicSimilarity], result of:
            0.059250638 = score(doc=1413,freq=1.0), product of:
              0.1209993 = queryWeight, product of:
                1.7678539 = boost
                3.9174201 = idf(docFreq=2390, maxDocs=44218)
                0.017471747 = queryNorm
              0.48967752 = fieldWeight in 1413, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9174201 = idf(docFreq=2390, maxDocs=44218)
                0.125 = fieldNorm(doc=1413)
          0.065175295 = weight(abstract_txt:text in 1413) [ClassicSimilarity], result of:
            0.065175295 = score(doc=1413,freq=1.0), product of:
              0.12893659 = queryWeight, product of:
                1.8249166 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017471747 = queryNorm
              0.5054833 = fieldWeight in 1413, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.125 = fieldNorm(doc=1413)
          0.32062614 = weight(abstract_txt:terminologie in 1413) [ClassicSimilarity], result of:
            0.32062614 = score(doc=1413,freq=1.0), product of:
              0.33884978 = queryWeight, product of:
                2.5620594 = boost
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.017471747 = queryNorm
              0.9462191 = fieldWeight in 1413, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.125 = fieldNorm(doc=1413)
        0.2 = coord(5/25)
    
  5. Strötgen, R.; Kokkelink, S.: Metadatenextraktion aus Internetquellen : Heterogenitätsbehandlung im Projekt CARMEN (2001) 0.13
    0.13349144 = sum of:
      0.13349144 = product of:
        0.55621433 = sum of:
          0.09906271 = weight(abstract_txt:termen in 5808) [ClassicSimilarity], result of:
            0.09906271 = score(doc=5808,freq=1.0), product of:
              0.17044894 = queryWeight, product of:
                1.0491126 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.017471747 = queryNorm
              0.581187 = fieldWeight in 5808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.0625 = fieldNorm(doc=5808)
          0.016655844 = weight(abstract_txt:sich in 5808) [ClassicSimilarity], result of:
            0.016655844 = score(doc=5808,freq=1.0), product of:
              0.07488687 = queryWeight, product of:
                1.2044489 = boost
                3.5586145 = idf(docFreq=3422, maxDocs=44218)
                0.017471747 = queryNorm
              0.2224134 = fieldWeight in 5808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5586145 = idf(docFreq=3422, maxDocs=44218)
                0.0625 = fieldNorm(doc=5808)
          0.06740525 = weight(abstract_txt:methoden in 5808) [ClassicSimilarity], result of:
            0.06740525 = score(doc=5808,freq=2.0), product of:
              0.13186109 = queryWeight, product of:
                1.3049632 = boost
                5.7833843 = idf(docFreq=369, maxDocs=44218)
                0.017471747 = queryNorm
              0.5111838 = fieldWeight in 5808, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7833843 = idf(docFreq=369, maxDocs=44218)
                0.0625 = fieldNorm(doc=5808)
          0.029625319 = weight(abstract_txt:sind in 5808) [ClassicSimilarity], result of:
            0.029625319 = score(doc=5808,freq=1.0), product of:
              0.1209993 = queryWeight, product of:
                1.7678539 = boost
                3.9174201 = idf(docFreq=2390, maxDocs=44218)
                0.017471747 = queryNorm
              0.24483876 = fieldWeight in 5808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9174201 = idf(docFreq=2390, maxDocs=44218)
                0.0625 = fieldNorm(doc=5808)
          0.0999901 = weight(abstract_txt:verfahren in 5808) [ClassicSimilarity], result of:
            0.0999901 = score(doc=5808,freq=2.0), product of:
              0.19633117 = queryWeight, product of:
                1.9502046 = boost
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.017471747 = queryNorm
              0.509293 = fieldWeight in 5808, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.0625 = fieldNorm(doc=5808)
          0.24347508 = weight(abstract_txt:extraktion in 5808) [ClassicSimilarity], result of:
            0.24347508 = score(doc=5808,freq=1.0), product of:
              0.44771084 = queryWeight, product of:
                2.9449937 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.017471747 = queryNorm
              0.54382217 = fieldWeight in 5808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.0625 = fieldNorm(doc=5808)
        0.24 = coord(6/25)