Document (#32319)

Author
Kaufmann, E.
Title
¬Das Indexieren von natürlichsprachlichen Dokumenten und die inverse Seitenhäufigkeit
Source
http://www.ifi.unizh.ch/cl/study/lizarbeiten/lizkaufmann.pdf
Year
2001
Abstract
Die Lizentiatsarbeit gibt im ersten theoretischen Teil einen Überblick über das Indexieren von Dokumenten. Sie zeigt die verschiedenen Typen von Indexen sowie die wichtigsten Aspekte bezüglich einer Indexsprache auf. Diverse manuelle und automatische Indexierungsverfahren werden präsentiert. Spezielle Aufmerksamkeit innerhalb des ersten Teils gilt den Schlagwortregistern, deren charakteristische Merkmale und Eigenheiten erörtert werden. Zusätzlich werden die gängigen Kriterien zur Bewertung von Indexen sowie die Masse zur Evaluation von Indexierungsverfahren und Indexierungsergebnissen vorgestellt. Im zweiten Teil der Arbeit werden fünf reale Bücher einer statistischen Untersuchung unterzogen. Zum einen werden die lexikalischen und syntaktischen Bestandteile der fünf Buchregister ermittelt, um den Inhalt von Schlagwortregistern zu erschliessen. Andererseits werden aus den Textausschnitten der Bücher Indexterme maschinell extrahiert und mit den Schlagworteinträgen in den Buchregistern verglichen. Das Hauptziel der Untersuchungen besteht darin, eine Indexierungsmethode, die auf linguistikorientierter Extraktion der Indexterme und Termhäufigkeitsgewichtung basiert, im Hinblick auf ihren Gebrauchswert für eine automatische Indexierung zu testen. Die Gewichtungsmethode ist die inverse Seitenhäufigkeit, eine Methode, welche von der inversen Dokumentfrequenz abgeleitet wurde, zur automatischen Erstellung von Schlagwortregistern für deutschsprachige Texte. Die Prüfung der Methode im statistischen Teil führte nicht zu zufriedenstellenden Resultaten.
Content
Lizentiatsarbeit der Philosphischen Fakultät der Universität Zürich, - Vgl. auch: http://www.ifi.unizh.ch/cl/study/lizarbeiten/lizkaufmann.pdf.
Theme
Automatisches Indexieren
Register

Similar documents (author)

  1. Kaufmann, N.C.: Kommt das Domainsterben? : Rechtsprechung gibt beschreibende Internet-Adressen zum Abschuss frei (2000) 5.97
    5.972007 = sum of:
      5.972007 = weight(author_txt:kaufmann in 5756) [ClassicSimilarity], result of:
        5.972007 = fieldWeight in 5756, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.555211 = idf(docFreq=7, maxDocs=41550)
          0.625 = fieldNorm(doc=5756)
    
  2. Kaufmann, T.: Googeln wie die Profis : Perfekte Suche (2004) 5.97
    5.972007 = sum of:
      5.972007 = weight(author_txt:kaufmann in 2926) [ClassicSimilarity], result of:
        5.972007 = fieldWeight in 2926, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.555211 = idf(docFreq=7, maxDocs=41550)
          0.625 = fieldNorm(doc=2926)
    
  3. Havemann, F.; Kaufmann, A.: ¬Der Wandel des Benutzerverhaltens in Zeiten des Internet : Ergebnisse von Befragungen an 13 Bibliotheken (2006) 4.78
    4.7776055 = sum of:
      4.7776055 = weight(author_txt:kaufmann in 1149) [ClassicSimilarity], result of:
        4.7776055 = fieldWeight in 1149, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.555211 = idf(docFreq=7, maxDocs=41550)
          0.5 = fieldNorm(doc=1149)
    
  4. Kaufmann, J.-C.: Wenn ICH ein anderer ist (2010) 4.78
    4.7776055 = sum of:
      4.7776055 = weight(author_txt:kaufmann in 638) [ClassicSimilarity], result of:
        4.7776055 = fieldWeight in 638, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.555211 = idf(docFreq=7, maxDocs=41550)
          0.5 = fieldNorm(doc=638)
    
  5. Kaufmann, J.-C.: ¬Die Erfindung des Ich : eine Theorie der Identität (2005) 4.78
    4.7776055 = sum of:
      4.7776055 = weight(author_txt:kaufmann in 639) [ClassicSimilarity], result of:
        4.7776055 = fieldWeight in 639, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.555211 = idf(docFreq=7, maxDocs=41550)
          0.5 = fieldNorm(doc=639)
    

Similar documents (content)

  1. Halip, I.: Automatische Extrahierung von Schlagworten aus unstrukturierten Texten (2005) 0.19
    0.190426 = sum of:
      0.190426 = product of:
        0.59508127 = sum of:
          0.07580451 = weight(abstract_txt:manuelle in 1986) [ClassicSimilarity], result of:
            0.07580451 = score(doc=1986,freq=1.0), product of:
              0.1574901 = queryWeight, product of:
                1.0065367 = boost
                8.801439 = idf(docFreq=16, maxDocs=41550)
                0.01777747 = queryNorm
              0.48132873 = fieldWeight in 1986, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.801439 = idf(docFreq=16, maxDocs=41550)
                0.0546875 = fieldNorm(doc=1986)
          0.022343218 = weight(abstract_txt:sowie in 1986) [ClassicSimilarity], result of:
            0.022343218 = score(doc=1986,freq=1.0), product of:
              0.087881416 = queryWeight, product of:
                1.0633261 = boost
                4.649011 = idf(docFreq=1080, maxDocs=41550)
                0.01777747 = queryNorm
              0.2542428 = fieldWeight in 1986, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.649011 = idf(docFreq=1080, maxDocs=41550)
                0.0546875 = fieldNorm(doc=1986)
          0.020876553 = weight(abstract_txt:eine in 1986) [ClassicSimilarity], result of:
            0.020876553 = score(doc=1986,freq=2.0), product of:
              0.076312006 = queryWeight, product of:
                1.2135565 = boost
                3.5372264 = idf(docFreq=3285, maxDocs=41550)
                0.01777747 = queryNorm
              0.2735684 = fieldWeight in 1986, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5372264 = idf(docFreq=3285, maxDocs=41550)
                0.0546875 = fieldNorm(doc=1986)
          0.10321707 = weight(abstract_txt:dokumenten in 1986) [ClassicSimilarity], result of:
            0.10321707 = score(doc=1986,freq=3.0), product of:
              0.16901575 = queryWeight, product of:
                1.4746249 = boost
                6.447267 = idf(docFreq=178, maxDocs=41550)
                0.01777747 = queryNorm
              0.610695 = fieldWeight in 1986, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.447267 = idf(docFreq=178, maxDocs=41550)
                0.0546875 = fieldNorm(doc=1986)
          0.0741392 = weight(abstract_txt:automatische in 1986) [ClassicSimilarity], result of:
            0.0741392 = score(doc=1986,freq=1.0), product of:
              0.1955083 = queryWeight, product of:
                1.5859904 = boost
                6.9341726 = idf(docFreq=109, maxDocs=41550)
                0.01777747 = queryNorm
              0.37921256 = fieldWeight in 1986, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9341726 = idf(docFreq=109, maxDocs=41550)
                0.0546875 = fieldNorm(doc=1986)
          0.056608234 = weight(abstract_txt:teil in 1986) [ClassicSimilarity], result of:
            0.056608234 = score(doc=1986,freq=1.0), product of:
              0.1869606 = queryWeight, product of:
                1.899497 = boost
                5.5365787 = idf(docFreq=444, maxDocs=41550)
                0.01777747 = queryNorm
              0.30278164 = fieldWeight in 1986, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5365787 = idf(docFreq=444, maxDocs=41550)
                0.0546875 = fieldNorm(doc=1986)
          0.17032996 = weight(abstract_txt:indexierungsverfahren in 1986) [ClassicSimilarity], result of:
            0.17032996 = score(doc=1986,freq=1.0), product of:
              0.34040347 = queryWeight, product of:
                2.0927384 = boost
                9.149746 = idf(docFreq=11, maxDocs=41550)
                0.01777747 = queryNorm
              0.5003767 = fieldWeight in 1986, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.149746 = idf(docFreq=11, maxDocs=41550)
                0.0546875 = fieldNorm(doc=1986)
          0.071762525 = weight(abstract_txt:werden in 1986) [ClassicSimilarity], result of:
            0.071762525 = score(doc=1986,freq=6.0), product of:
              0.15184076 = queryWeight, product of:
                2.4208772 = boost
                3.5281384 = idf(docFreq=3315, maxDocs=41550)
                0.01777747 = queryNorm
              0.472617 = fieldWeight in 1986, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.5281384 = idf(docFreq=3315, maxDocs=41550)
                0.0546875 = fieldNorm(doc=1986)
        0.32 = coord(8/25)
    
  2. Bredack, J.: Automatische Extraktion fachterminologischer Mehrwortbegriffe : ein Verfahrensvergleich (2016) 0.15
    0.15259627 = sum of:
      0.15259627 = product of:
        0.63581777 = sum of:
          0.094799295 = weight(abstract_txt:extrahiert in 5194) [ClassicSimilarity], result of:
            0.094799295 = score(doc=5194,freq=1.0), product of:
              0.16723686 = queryWeight, product of:
                1.0372155 = boost
                9.069703 = idf(docFreq=12, maxDocs=41550)
                0.01777747 = queryNorm
              0.56685644 = fieldWeight in 5194, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.069703 = idf(docFreq=12, maxDocs=41550)
                0.0625 = fieldNorm(doc=5194)
          0.023858918 = weight(abstract_txt:eine in 5194) [ClassicSimilarity], result of:
            0.023858918 = score(doc=5194,freq=2.0), product of:
              0.076312006 = queryWeight, product of:
                1.2135565 = boost
                3.5372264 = idf(docFreq=3285, maxDocs=41550)
                0.01777747 = queryNorm
              0.3126496 = fieldWeight in 5194, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5372264 = idf(docFreq=3285, maxDocs=41550)
                0.0625 = fieldNorm(doc=5194)
          0.08473052 = weight(abstract_txt:automatische in 5194) [ClassicSimilarity], result of:
            0.08473052 = score(doc=5194,freq=1.0), product of:
              0.1955083 = queryWeight, product of:
                1.5859904 = boost
                6.9341726 = idf(docFreq=109, maxDocs=41550)
                0.01777747 = queryNorm
              0.4333858 = fieldWeight in 5194, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9341726 = idf(docFreq=109, maxDocs=41550)
                0.0625 = fieldNorm(doc=5194)
          0.1287091 = weight(abstract_txt:statistischen in 5194) [ClassicSimilarity], result of:
            0.1287091 = score(doc=5194,freq=1.0), product of:
              0.25835177 = queryWeight, product of:
                1.8231554 = boost
                7.9710913 = idf(docFreq=38, maxDocs=41550)
                0.01777747 = queryNorm
              0.4981932 = fieldWeight in 5194, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9710913 = idf(docFreq=38, maxDocs=41550)
                0.0625 = fieldNorm(doc=5194)
          0.22170565 = weight(abstract_txt:indexterme in 5194) [ClassicSimilarity], result of:
            0.22170565 = score(doc=5194,freq=1.0), product of:
              0.37124145 = queryWeight, product of:
                2.1854768 = boost
                9.555211 = idf(docFreq=7, maxDocs=41550)
                0.01777747 = queryNorm
              0.5972007 = fieldWeight in 5194, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.555211 = idf(docFreq=7, maxDocs=41550)
                0.0625 = fieldNorm(doc=5194)
          0.082014315 = weight(abstract_txt:werden in 5194) [ClassicSimilarity], result of:
            0.082014315 = score(doc=5194,freq=6.0), product of:
              0.15184076 = queryWeight, product of:
                2.4208772 = boost
                3.5281384 = idf(docFreq=3315, maxDocs=41550)
                0.01777747 = queryNorm
              0.5401337 = fieldWeight in 5194, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.5281384 = idf(docFreq=3315, maxDocs=41550)
                0.0625 = fieldNorm(doc=5194)
        0.24 = coord(6/25)
    
  3. Leonhardt, H.A.: Systematik "Ästhetische Kulturwissenschaft" an der Universitätsbibliothek Hildesheim : ein Innovationsbericht (2018) 0.13
    0.13191198 = sum of:
      0.13191198 = product of:
        0.65955985 = sum of:
          0.033741605 = weight(abstract_txt:eine in 490) [ClassicSimilarity], result of:
            0.033741605 = score(doc=490,freq=1.0), product of:
              0.076312006 = queryWeight, product of:
                1.2135565 = boost
                3.5372264 = idf(docFreq=3285, maxDocs=41550)
                0.01777747 = queryNorm
              0.4421533 = fieldWeight in 490, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5372264 = idf(docFreq=3285, maxDocs=41550)
                0.125 = fieldNorm(doc=490)
          0.15142104 = weight(abstract_txt:bücher in 490) [ClassicSimilarity], result of:
            0.15142104 = score(doc=490,freq=1.0), product of:
              0.18137445 = queryWeight, product of:
                1.5275872 = boost
                6.678826 = idf(docFreq=141, maxDocs=41550)
                0.01777747 = queryNorm
              0.83485323 = fieldWeight in 490, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.678826 = idf(docFreq=141, maxDocs=41550)
                0.125 = fieldNorm(doc=490)
          0.250305 = weight(abstract_txt:indexieren in 490) [ClassicSimilarity], result of:
            0.250305 = score(doc=490,freq=1.0), product of:
              0.25357026 = queryWeight, product of:
                1.8062053 = boost
                7.896983 = idf(docFreq=41, maxDocs=41550)
                0.01777747 = queryNorm
              0.9871229 = fieldWeight in 490, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.896983 = idf(docFreq=41, maxDocs=41550)
                0.125 = fieldNorm(doc=490)
          0.12939025 = weight(abstract_txt:teil in 490) [ClassicSimilarity], result of:
            0.12939025 = score(doc=490,freq=1.0), product of:
              0.1869606 = queryWeight, product of:
                1.899497 = boost
                5.5365787 = idf(docFreq=444, maxDocs=41550)
                0.01777747 = queryNorm
              0.69207233 = fieldWeight in 490, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5365787 = idf(docFreq=444, maxDocs=41550)
                0.125 = fieldNorm(doc=490)
          0.09470196 = weight(abstract_txt:werden in 490) [ClassicSimilarity], result of:
            0.09470196 = score(doc=490,freq=2.0), product of:
              0.15184076 = queryWeight, product of:
                2.4208772 = boost
                3.5281384 = idf(docFreq=3315, maxDocs=41550)
                0.01777747 = queryNorm
              0.62369263 = fieldWeight in 490, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5281384 = idf(docFreq=3315, maxDocs=41550)
                0.125 = fieldNorm(doc=490)
        0.2 = coord(5/25)
    
  4. Larroche-Boutet, V.; Pöhl, K.: ¬Das Nominalsyntagna : über die Nutzbarmachung eines logico-semantischen Konzeptes für dokumentarische Fragestellungen (1993) 0.12
    0.11772972 = sum of:
      0.11772972 = product of:
        0.5886486 = sum of:
          0.094799295 = weight(abstract_txt:extrahiert in 6760) [ClassicSimilarity], result of:
            0.094799295 = score(doc=6760,freq=1.0), product of:
              0.16723686 = queryWeight, product of:
                1.0372155 = boost
                9.069703 = idf(docFreq=12, maxDocs=41550)
                0.01777747 = queryNorm
              0.56685644 = fieldWeight in 6760, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.069703 = idf(docFreq=12, maxDocs=41550)
                0.0625 = fieldNorm(doc=6760)
          0.023858918 = weight(abstract_txt:eine in 6760) [ClassicSimilarity], result of:
            0.023858918 = score(doc=6760,freq=2.0), product of:
              0.076312006 = queryWeight, product of:
                1.2135565 = boost
                3.5372264 = idf(docFreq=3285, maxDocs=41550)
                0.01777747 = queryNorm
              0.3126496 = fieldWeight in 6760, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5372264 = idf(docFreq=3285, maxDocs=41550)
                0.0625 = fieldNorm(doc=6760)
          0.119827054 = weight(abstract_txt:automatische in 6760) [ClassicSimilarity], result of:
            0.119827054 = score(doc=6760,freq=2.0), product of:
              0.1955083 = queryWeight, product of:
                1.5859904 = boost
                6.9341726 = idf(docFreq=109, maxDocs=41550)
                0.01777747 = queryNorm
              0.6129001 = fieldWeight in 6760, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9341726 = idf(docFreq=109, maxDocs=41550)
                0.0625 = fieldNorm(doc=6760)
          0.2752948 = weight(abstract_txt:indexierungsverfahren in 6760) [ClassicSimilarity], result of:
            0.2752948 = score(doc=6760,freq=2.0), product of:
              0.34040347 = queryWeight, product of:
                2.0927384 = boost
                9.149746 = idf(docFreq=11, maxDocs=41550)
                0.01777747 = queryNorm
              0.8087309 = fieldWeight in 6760, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.149746 = idf(docFreq=11, maxDocs=41550)
                0.0625 = fieldNorm(doc=6760)
          0.07486848 = weight(abstract_txt:werden in 6760) [ClassicSimilarity], result of:
            0.07486848 = score(doc=6760,freq=5.0), product of:
              0.15184076 = queryWeight, product of:
                2.4208772 = boost
                3.5281384 = idf(docFreq=3315, maxDocs=41550)
                0.01777747 = queryNorm
              0.49307233 = fieldWeight in 6760, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.5281384 = idf(docFreq=3315, maxDocs=41550)
                0.0625 = fieldNorm(doc=6760)
        0.2 = coord(5/25)
    
  5. Peters, G.; Gaese, V.: ¬Das DocCat-System in der Textdokumentation von G+J (2003) 0.11
    0.11202664 = sum of:
      0.11202664 = product of:
        0.40009513 = sum of:
          0.03578838 = weight(abstract_txt:eine in 2507) [ClassicSimilarity], result of:
            0.03578838 = score(doc=2507,freq=8.0), product of:
              0.076312006 = queryWeight, product of:
                1.2135565 = boost
                3.5372264 = idf(docFreq=3285, maxDocs=41550)
                0.01777747 = queryNorm
              0.4689744 = fieldWeight in 2507, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.5372264 = idf(docFreq=3285, maxDocs=41550)
                0.046875 = fieldNorm(doc=2507)
          0.051079202 = weight(abstract_txt:dokumenten in 2507) [ClassicSimilarity], result of:
            0.051079202 = score(doc=2507,freq=1.0), product of:
              0.16901575 = queryWeight, product of:
                1.4746249 = boost
                6.447267 = idf(docFreq=178, maxDocs=41550)
                0.01777747 = queryNorm
              0.30221564 = fieldWeight in 2507, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.447267 = idf(docFreq=178, maxDocs=41550)
                0.046875 = fieldNorm(doc=2507)
          0.06354789 = weight(abstract_txt:automatische in 2507) [ClassicSimilarity], result of:
            0.06354789 = score(doc=2507,freq=1.0), product of:
              0.1955083 = queryWeight, product of:
                1.5859904 = boost
                6.9341726 = idf(docFreq=109, maxDocs=41550)
                0.01777747 = queryNorm
              0.32503933 = fieldWeight in 2507, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9341726 = idf(docFreq=109, maxDocs=41550)
                0.046875 = fieldNorm(doc=2507)
          0.06379929 = weight(abstract_txt:methode in 2507) [ClassicSimilarity], result of:
            0.06379929 = score(doc=2507,freq=1.0), product of:
              0.1960236 = queryWeight, product of:
                1.5880791 = boost
                6.943305 = idf(docFreq=108, maxDocs=41550)
                0.01777747 = queryNorm
              0.3254674 = fieldWeight in 2507, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.943305 = idf(docFreq=108, maxDocs=41550)
                0.046875 = fieldNorm(doc=2507)
          0.093864374 = weight(abstract_txt:indexieren in 2507) [ClassicSimilarity], result of:
            0.093864374 = score(doc=2507,freq=1.0), product of:
              0.25357026 = queryWeight, product of:
                1.8062053 = boost
                7.896983 = idf(docFreq=41, maxDocs=41550)
                0.01777747 = queryNorm
              0.37017107 = fieldWeight in 2507, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.896983 = idf(docFreq=41, maxDocs=41550)
                0.046875 = fieldNorm(doc=2507)
          0.048521344 = weight(abstract_txt:teil in 2507) [ClassicSimilarity], result of:
            0.048521344 = score(doc=2507,freq=1.0), product of:
              0.1869606 = queryWeight, product of:
                1.899497 = boost
                5.5365787 = idf(docFreq=444, maxDocs=41550)
                0.01777747 = queryNorm
              0.25952712 = fieldWeight in 2507, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5365787 = idf(docFreq=444, maxDocs=41550)
                0.046875 = fieldNorm(doc=2507)
          0.043494653 = weight(abstract_txt:werden in 2507) [ClassicSimilarity], result of:
            0.043494653 = score(doc=2507,freq=3.0), product of:
              0.15184076 = queryWeight, product of:
                2.4208772 = boost
                3.5281384 = idf(docFreq=3315, maxDocs=41550)
                0.01777747 = queryNorm
              0.28644913 = fieldWeight in 2507, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.5281384 = idf(docFreq=3315, maxDocs=41550)
                0.046875 = fieldNorm(doc=2507)
        0.28 = coord(7/25)