Document (#26672)

Author
Braun, T.
Title
Dokumentklassifikation durch Clustering
Source
http://www.cl-ki.uni-osnabrueck.de/~bikini/Vortraege/Klassifikation/classif.html
Year
o.J.
Abstract
Beim Clustering werden Dokumente aufgrund von Ähnlichkeiten untereinander klassifiziert, im Gegensatz z.B. zur Klassifikation anhand einer Ontologie. Bei den gebräuchlichen Clusteringverfahren wird ein Dokument als die Menge seiner Wörter angesehen. Zur Bestimmung der Ähnlichkeit zwischen Dokumenten werden verschiedene Ähnlichkeitsmaße definiert.
Theme
Automatisches Klassifizieren

Similar documents (author)

  1. Braun, H.: Sacherschließung 1978 (1979) 5.32
    5.3242707 = sum of:
      5.3242707 = weight(author_txt:braun in 1601) [ClassicSimilarity], result of:
        5.3242707 = fieldWeight in 1601, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.518833 = idf(docFreq=23, maxDocs=44218)
          0.625 = fieldNorm(doc=1601)
    
  2. Braun, M.: Aufbau eines Fax-Netzes (1993) 5.32
    5.3242707 = sum of:
      5.3242707 = weight(author_txt:braun in 3029) [ClassicSimilarity], result of:
        5.3242707 = fieldWeight in 3029, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.518833 = idf(docFreq=23, maxDocs=44218)
          0.625 = fieldNorm(doc=3029)
    
  3. Braun, I.: Medienökologie - ein blinder Fleck der Sozialwissenschaften : auch Computernetze sind an der Zerstörung des Planeten beteiligt (1995) 5.32
    5.3242707 = sum of:
      5.3242707 = weight(author_txt:braun in 2107) [ClassicSimilarity], result of:
        5.3242707 = fieldWeight in 2107, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.518833 = idf(docFreq=23, maxDocs=44218)
          0.625 = fieldNorm(doc=2107)
    
  4. Braun, W.: Etymologisches Wörterbuch des Deutschen (1995) 5.32
    5.3242707 = sum of:
      5.3242707 = weight(author_txt:braun in 4356) [ClassicSimilarity], result of:
        5.3242707 = fieldWeight in 4356, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.518833 = idf(docFreq=23, maxDocs=44218)
          0.625 = fieldNorm(doc=4356)
    
  5. Braun, H.: Neuronale Netze : Optimierung durch Lernen und Evolution (1997) 5.32
    5.3242707 = sum of:
      5.3242707 = weight(author_txt:braun in 730) [ClassicSimilarity], result of:
        5.3242707 = fieldWeight in 730, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.518833 = idf(docFreq=23, maxDocs=44218)
          0.625 = fieldNorm(doc=730)
    

Similar documents (content)

  1. Dahmen, E.: Klassifikation als Ordnundssystem im elektronischen Pressearchiv (2003) 0.19
    0.1861708 = sum of:
      0.1861708 = product of:
        0.6648957 = sum of:
          0.013135196 = weight(abstract_txt:durch in 1513) [ClassicSimilarity], result of:
            0.013135196 = score(doc=1513,freq=1.0), product of:
              0.06597406 = queryWeight, product of:
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.015532848 = queryNorm
              0.19909638 = fieldWeight in 1513, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.046875 = fieldNorm(doc=1513)
          0.03204816 = weight(abstract_txt:beim in 1513) [ClassicSimilarity], result of:
            0.03204816 = score(doc=1513,freq=1.0), product of:
              0.11956871 = queryWeight, product of:
                1.3462391 = boost
                5.7180014 = idf(docFreq=394, maxDocs=44218)
                0.015532848 = queryNorm
              0.2680313 = fieldWeight in 1513, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7180014 = idf(docFreq=394, maxDocs=44218)
                0.046875 = fieldNorm(doc=1513)
          0.036716472 = weight(abstract_txt:verschiedene in 1513) [ClassicSimilarity], result of:
            0.036716472 = score(doc=1513,freq=1.0), product of:
              0.13091502 = queryWeight, product of:
                1.4086664 = boost
                5.9831543 = idf(docFreq=302, maxDocs=44218)
                0.015532848 = queryNorm
              0.28046036 = fieldWeight in 1513, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9831543 = idf(docFreq=302, maxDocs=44218)
                0.046875 = fieldNorm(doc=1513)
          0.34493893 = weight(title_txt:klassifikation in 1513) [ClassicSimilarity], result of:
            0.34493893 = score(doc=1513,freq=1.0), product of:
              0.14571927 = queryWeight, product of:
                1.4861817 = boost
                6.312392 = idf(docFreq=217, maxDocs=44218)
                0.015532848 = queryNorm
              2.367147 = fieldWeight in 1513, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.312392 = idf(docFreq=217, maxDocs=44218)
                0.375 = fieldNorm(doc=1513)
          0.029557046 = weight(abstract_txt:werden in 1513) [ClassicSimilarity], result of:
            0.029557046 = score(doc=1513,freq=4.0), product of:
              0.089917906 = queryWeight, product of:
                1.6510168 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.015532848 = queryNorm
              0.32871145 = fieldWeight in 1513, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.046875 = fieldNorm(doc=1513)
          0.09904575 = weight(abstract_txt:dokument in 1513) [ClassicSimilarity], result of:
            0.09904575 = score(doc=1513,freq=2.0), product of:
              0.20135516 = queryWeight, product of:
                1.7470076 = boost
                7.4202213 = idf(docFreq=71, maxDocs=44218)
                0.015532848 = queryNorm
              0.4918958 = fieldWeight in 1513, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.4202213 = idf(docFreq=71, maxDocs=44218)
                0.046875 = fieldNorm(doc=1513)
          0.10945415 = weight(abstract_txt:wörter in 1513) [ClassicSimilarity], result of:
            0.10945415 = score(doc=1513,freq=2.0), product of:
              0.21522546 = queryWeight, product of:
                1.8061767 = boost
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.015532848 = queryNorm
              0.50855577 = fieldWeight in 1513, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.046875 = fieldNorm(doc=1513)
        0.28 = coord(7/25)
    
  2. Probst, M.; Mittelbach, J.: Maschinelle Indexierung in der Sacherschließung wissenschaftlicher Bibliotheken (2006) 0.18
    0.1766297 = sum of:
      0.1766297 = product of:
        0.73595715 = sum of:
          0.03064879 = weight(abstract_txt:durch in 1755) [ClassicSimilarity], result of:
            0.03064879 = score(doc=1755,freq=1.0), product of:
              0.06597406 = queryWeight, product of:
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.015532848 = queryNorm
              0.4645582 = fieldWeight in 1755, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.109375 = fieldNorm(doc=1755)
          0.07477903 = weight(abstract_txt:beim in 1755) [ClassicSimilarity], result of:
            0.07477903 = score(doc=1755,freq=1.0), product of:
              0.11956871 = queryWeight, product of:
                1.3462391 = boost
                5.7180014 = idf(docFreq=394, maxDocs=44218)
                0.015532848 = queryNorm
              0.6254064 = fieldWeight in 1755, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7180014 = idf(docFreq=394, maxDocs=44218)
                0.109375 = fieldNorm(doc=1755)
          0.10706261 = weight(abstract_txt:dokumenten in 1755) [ClassicSimilarity], result of:
            0.10706261 = score(doc=1755,freq=1.0), product of:
              0.15188779 = queryWeight, product of:
                1.5173118 = boost
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.015532848 = queryNorm
              0.70487964 = fieldWeight in 1755, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.109375 = fieldNorm(doc=1755)
          0.03448322 = weight(abstract_txt:werden in 1755) [ClassicSimilarity], result of:
            0.03448322 = score(doc=1755,freq=1.0), product of:
              0.089917906 = queryWeight, product of:
                1.6510168 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.015532848 = queryNorm
              0.3834967 = fieldWeight in 1755, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.109375 = fieldNorm(doc=1755)
          0.15736343 = weight(abstract_txt:gegensatz in 1755) [ClassicSimilarity], result of:
            0.15736343 = score(doc=1755,freq=1.0), product of:
              0.19635119 = queryWeight, product of:
                1.7251631 = boost
                7.3274393 = idf(docFreq=78, maxDocs=44218)
                0.015532848 = queryNorm
              0.8014387 = fieldWeight in 1755, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3274393 = idf(docFreq=78, maxDocs=44218)
                0.109375 = fieldNorm(doc=1755)
          0.33162 = weight(abstract_txt:gebräuchlichen in 1755) [ClassicSimilarity], result of:
            0.33162 = score(doc=1755,freq=1.0), product of:
              0.32274395 = queryWeight, product of:
                2.2117827 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.015532848 = queryNorm
              1.0275018 = fieldWeight in 1755, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.109375 = fieldNorm(doc=1755)
        0.24 = coord(6/25)
    
  3. Fuhr, N.: Theorie des Information Retrieval I : Modelle (2004) 0.15
    0.14765933 = sum of:
      0.14765933 = product of:
        0.5273548 = sum of:
          0.04157736 = weight(abstract_txt:zwischen in 2912) [ClassicSimilarity], result of:
            0.04157736 = score(doc=2912,freq=2.0), product of:
              0.093186066 = queryWeight, product of:
                1.1884718 = boost
                5.0479026 = idf(docFreq=771, maxDocs=44218)
                0.015532848 = queryNorm
              0.44617575 = fieldWeight in 2912, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0479026 = idf(docFreq=771, maxDocs=44218)
                0.0625 = fieldNorm(doc=2912)
          0.055921715 = weight(abstract_txt:dokumente in 2912) [ClassicSimilarity], result of:
            0.055921715 = score(doc=2912,freq=1.0), product of:
              0.14305729 = queryWeight, product of:
                1.4725444 = boost
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.015532848 = queryNorm
              0.39090434 = fieldWeight in 2912, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.0625 = fieldNorm(doc=2912)
          0.105964504 = weight(abstract_txt:dokumenten in 2912) [ClassicSimilarity], result of:
            0.105964504 = score(doc=2912,freq=3.0), product of:
              0.15188779 = queryWeight, product of:
                1.5173118 = boost
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.015532848 = queryNorm
              0.6976499 = fieldWeight in 2912, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.0625 = fieldNorm(doc=2912)
          0.039409395 = weight(abstract_txt:werden in 2912) [ClassicSimilarity], result of:
            0.039409395 = score(doc=2912,freq=4.0), product of:
              0.089917906 = queryWeight, product of:
                1.6510168 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.015532848 = queryNorm
              0.43828195 = fieldWeight in 2912, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.0625 = fieldNorm(doc=2912)
          0.11157369 = weight(abstract_txt:menge in 2912) [ClassicSimilarity], result of:
            0.11157369 = score(doc=2912,freq=2.0), product of:
              0.17995097 = queryWeight, product of:
                1.6515454 = boost
                7.014756 = idf(docFreq=107, maxDocs=44218)
                0.015532848 = queryNorm
              0.6200227 = fieldWeight in 2912, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.014756 = idf(docFreq=107, maxDocs=44218)
                0.0625 = fieldNorm(doc=2912)
          0.07952689 = weight(abstract_txt:definiert in 2912) [ClassicSimilarity], result of:
            0.07952689 = score(doc=2912,freq=1.0), product of:
              0.18091129 = queryWeight, product of:
                1.6559463 = boost
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.015532848 = queryNorm
              0.4395905 = fieldWeight in 2912, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.0625 = fieldNorm(doc=2912)
          0.09338124 = weight(abstract_txt:dokument in 2912) [ClassicSimilarity], result of:
            0.09338124 = score(doc=2912,freq=1.0), product of:
              0.20135516 = queryWeight, product of:
                1.7470076 = boost
                7.4202213 = idf(docFreq=71, maxDocs=44218)
                0.015532848 = queryNorm
              0.46376383 = fieldWeight in 2912, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4202213 = idf(docFreq=71, maxDocs=44218)
                0.0625 = fieldNorm(doc=2912)
        0.28 = coord(7/25)
    
  4. Schek, M.: Automatische Klassifizierung und Visualisierung im Archiv der Süddeutschen Zeitung (2005) 0.14
    0.1387258 = sum of:
      0.1387258 = product of:
        0.38534945 = sum of:
          0.021891993 = weight(abstract_txt:durch in 4884) [ClassicSimilarity], result of:
            0.021891993 = score(doc=4884,freq=4.0), product of:
              0.06597406 = queryWeight, product of:
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.015532848 = queryNorm
              0.33182728 = fieldWeight in 4884, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4884)
          0.018374773 = weight(abstract_txt:zwischen in 4884) [ClassicSimilarity], result of:
            0.018374773 = score(doc=4884,freq=1.0), product of:
              0.093186066 = queryWeight, product of:
                1.1884718 = boost
                5.0479026 = idf(docFreq=771, maxDocs=44218)
                0.015532848 = queryNorm
              0.1971837 = fieldWeight in 4884, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0479026 = idf(docFreq=771, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4884)
          0.026358018 = weight(abstract_txt:anhand in 4884) [ClassicSimilarity], result of:
            0.026358018 = score(doc=4884,freq=1.0), product of:
              0.118525416 = queryWeight, product of:
                1.3403529 = boost
                5.6930003 = idf(docFreq=404, maxDocs=44218)
                0.015532848 = queryNorm
              0.22238283 = fieldWeight in 4884, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6930003 = idf(docFreq=404, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4884)
          0.021330962 = weight(abstract_txt:werden in 4884) [ClassicSimilarity], result of:
            0.021330962 = score(doc=4884,freq=3.0), product of:
              0.089917906 = queryWeight, product of:
                1.6510168 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.015532848 = queryNorm
              0.23722707 = fieldWeight in 4884, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4884)
          0.049704302 = weight(abstract_txt:definiert in 4884) [ClassicSimilarity], result of:
            0.049704302 = score(doc=4884,freq=1.0), product of:
              0.18091129 = queryWeight, product of:
                1.6559463 = boost
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.015532848 = queryNorm
              0.27474406 = fieldWeight in 4884, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4884)
          0.056201223 = weight(abstract_txt:gegensatz in 4884) [ClassicSimilarity], result of:
            0.056201223 = score(doc=4884,freq=1.0), product of:
              0.19635119 = queryWeight, product of:
                1.7251631 = boost
                7.3274393 = idf(docFreq=78, maxDocs=44218)
                0.015532848 = queryNorm
              0.2862281 = fieldWeight in 4884, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3274393 = idf(docFreq=78, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4884)
          0.058363274 = weight(abstract_txt:dokument in 4884) [ClassicSimilarity], result of:
            0.058363274 = score(doc=4884,freq=1.0), product of:
              0.20135516 = queryWeight, product of:
                1.7470076 = boost
                7.4202213 = idf(docFreq=71, maxDocs=44218)
                0.015532848 = queryNorm
              0.28985238 = fieldWeight in 4884, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4202213 = idf(docFreq=71, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4884)
          0.06449647 = weight(abstract_txt:untereinander in 4884) [ClassicSimilarity], result of:
            0.06449647 = score(doc=4884,freq=1.0), product of:
              0.21522546 = queryWeight, product of:
                1.8061767 = boost
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.015532848 = queryNorm
              0.29966936 = fieldWeight in 4884, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4884)
          0.068628445 = weight(abstract_txt:clustering in 4884) [ClassicSimilarity], result of:
            0.068628445 = score(doc=4884,freq=1.0), product of:
              0.2826284 = queryWeight, product of:
                2.9270914 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.015532848 = queryNorm
              0.2428222 = fieldWeight in 4884, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4884)
        0.36 = coord(9/25)
    
  5. Güntner, G.; Sint, R.; Westenthaler, R.: ¬Ein Ansatz zur Unterstützung traditioneller Klassifikation durch Social Tagging (2008) 0.13
    0.13385512 = sum of:
      0.13385512 = product of:
        0.6692756 = sum of:
          0.026270391 = weight(abstract_txt:durch in 2897) [ClassicSimilarity], result of:
            0.026270391 = score(doc=2897,freq=1.0), product of:
              0.06597406 = queryWeight, product of:
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.015532848 = queryNorm
              0.39819276 = fieldWeight in 2897, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.09375 = fieldNorm(doc=2897)
          0.08388257 = weight(abstract_txt:dokumente in 2897) [ClassicSimilarity], result of:
            0.08388257 = score(doc=2897,freq=1.0), product of:
              0.14305729 = queryWeight, product of:
                1.4725444 = boost
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.015532848 = queryNorm
              0.5863565 = fieldWeight in 2897, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.09375 = fieldNorm(doc=2897)
          0.28744915 = weight(title_txt:klassifikation in 2897) [ClassicSimilarity], result of:
            0.28744915 = score(doc=2897,freq=1.0), product of:
              0.14571927 = queryWeight, product of:
                1.4861817 = boost
                6.312392 = idf(docFreq=217, maxDocs=44218)
                0.015532848 = queryNorm
              1.9726226 = fieldWeight in 2897, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.312392 = idf(docFreq=217, maxDocs=44218)
                0.3125 = fieldNorm(doc=2897)
          0.041799977 = weight(abstract_txt:werden in 2897) [ClassicSimilarity], result of:
            0.041799977 = score(doc=2897,freq=2.0), product of:
              0.089917906 = queryWeight, product of:
                1.6510168 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.015532848 = queryNorm
              0.46486822 = fieldWeight in 2897, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.09375 = fieldNorm(doc=2897)
          0.2298735 = weight(abstract_txt:klassifiziert in 2897) [ClassicSimilarity], result of:
            0.2298735 = score(doc=2897,freq=1.0), product of:
              0.28014836 = queryWeight, product of:
                2.0606654 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.015532848 = queryNorm
              0.820542 = fieldWeight in 2897, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.09375 = fieldNorm(doc=2897)
        0.2 = coord(5/25)