Document (#37474)

Author
Wartena, C.
Sommer, M.
Title
Automatic classification of scientific records using the German Subject Heading Authority File (SWD)
Source
Proceedings of the 2nd International Workshop on Semantic Digital Archives held in conjunction with the 16th Int. Conference on Theory and Practice of Digital Libraries (TPDL) on September 27, 2012 in Paphos, Cyprus [http://ceur-ws.org/Vol-912/proceedings.pdf]. Eds.: A. Mitschik et al
Year
2012
Pages
S.37-48
Abstract
The following paper deals with an automatic text classification method which does not require training documents. For this method the German Subject Heading Authority File (SWD), provided by the linked data service of the German National Library is used. Recently the SWD was enriched with notations of the Dewey Decimal Classification (DDC). In consequence it became possible to utilize the subject headings as textual representations for the notations of the DDC. Basically, we we derive the classification of a text from the classification of the words in the text given by the thesaurus. The method was tested by classifying 3826 OAI-Records from 7 different repositories. Mean reciprocal rank and recall were chosen as evaluation measure. Direct comparison to a machine learning method has shown that this method is definitely competitive. Thus we can conclude that the enriched version of the SWD provides high quality information with a broad coverage for classification of German scientific articles.
Content
This work is partially based on the Bachelor thesis of Maike Sommer. Vgl. auch: http://sda2012.dke-research.de.
Theme
Automatisches Klassifizieren
Object
DDC
SWD

Similar documents (author)

  1. Sommer, F.T.: Theorie neuronaler Assoziativspeicher : lokales Lernen und iteratives Retrieval von Information (1994) 6.00
    5.9971275 = sum of:
      5.9971275 = weight(author_txt:sommer in 4170) [ClassicSimilarity], result of:
        5.9971275 = fieldWeight in 4170, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.595404 = idf(docFreq=7, maxDocs=43254)
          0.625 = fieldNorm(doc=4170)
    
  2. Sommer, D.: Zwölf Jahre Projektarbeit am VD 17 in der Universitäts- und Landesbibliothek Sachsen-Anhalt - eine Bilanz (2008) 6.00
    5.9971275 = sum of:
      5.9971275 = weight(author_txt:sommer in 4337) [ClassicSimilarity], result of:
        5.9971275 = fieldWeight in 4337, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.595404 = idf(docFreq=7, maxDocs=43254)
          0.625 = fieldNorm(doc=4337)
    
  3. Sommer, M.: Automatische Generierung von DDC-Notationen für Hochschulveröffentlichungen (2012) 6.00
    5.9971275 = sum of:
      5.9971275 = weight(author_txt:sommer in 2052) [ClassicSimilarity], result of:
        5.9971275 = fieldWeight in 2052, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.595404 = idf(docFreq=7, maxDocs=43254)
          0.625 = fieldNorm(doc=2052)
    
  4. Sommer, D.: VD16, VD17, VD18 : Diversität und Integration (2010) 6.00
    5.9971275 = sum of:
      5.9971275 = weight(author_txt:sommer in 4332) [ClassicSimilarity], result of:
        5.9971275 = fieldWeight in 4332, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.595404 = idf(docFreq=7, maxDocs=43254)
          0.625 = fieldNorm(doc=4332)
    
  5. Sommer, D.; Schöning-Walter, C.; Heiligenhaus, K.: URN Granular : persistente Identifizierung und Adressierung von Einzelseiten digitalisierter Drucke (2008) 3.60
    3.5982764 = sum of:
      3.5982764 = weight(author_txt:sommer in 394) [ClassicSimilarity], result of:
        3.5982764 = fieldWeight in 394, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.595404 = idf(docFreq=7, maxDocs=43254)
          0.375 = fieldNorm(doc=394)
    

Similar documents (content)

  1. Jacobs, J.-H.; Mengel, T.; Müller, K.: Benefits of the CrissCross project for conceptual interoperability and retrieval (2010) 0.26
    0.2623126 = sum of:
      0.2623126 = product of:
        1.0929692 = sum of:
          0.10376474 = weight(abstract_txt:authority in 39) [ClassicSimilarity], result of:
            0.10376474 = score(doc=39,freq=1.0), product of:
              0.15607424 = queryWeight, product of:
                1.6411074 = boost
                5.3187375 = idf(docFreq=575, maxDocs=43254)
                0.017880747 = queryNorm
              0.6648422 = fieldWeight in 39, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3187375 = idf(docFreq=575, maxDocs=43254)
                0.125 = fieldNorm(doc=39)
          0.118893884 = weight(abstract_txt:file in 39) [ClassicSimilarity], result of:
            0.118893884 = score(doc=39,freq=1.0), product of:
              0.17089829 = queryWeight, product of:
                1.7172766 = boost
                5.5655975 = idf(docFreq=449, maxDocs=43254)
                0.017880747 = queryNorm
              0.6956997 = fieldWeight in 39, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5655975 = idf(docFreq=449, maxDocs=43254)
                0.125 = fieldNorm(doc=39)
          0.061842512 = weight(abstract_txt:subject in 39) [ClassicSimilarity], result of:
            0.061842512 = score(doc=39,freq=1.0), product of:
              0.12652797 = queryWeight, product of:
                1.8097154 = boost
                3.9101245 = idf(docFreq=2355, maxDocs=43254)
                0.017880747 = queryNorm
              0.48876557 = fieldWeight in 39, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9101245 = idf(docFreq=2355, maxDocs=43254)
                0.125 = fieldNorm(doc=39)
          0.33592892 = weight(abstract_txt:notations in 39) [ClassicSimilarity], result of:
            0.33592892 = score(doc=39,freq=1.0), product of:
              0.34155682 = queryWeight, product of:
                2.427744 = boost
                7.8681827 = idf(docFreq=44, maxDocs=43254)
                0.017880747 = queryNorm
              0.98352283 = fieldWeight in 39, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8681827 = idf(docFreq=44, maxDocs=43254)
                0.125 = fieldNorm(doc=39)
          0.13220373 = weight(abstract_txt:classification in 39) [ClassicSimilarity], result of:
            0.13220373 = score(doc=39,freq=1.0), product of:
              0.26454583 = queryWeight, product of:
                3.7006881 = boost
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.017880747 = queryNorm
              0.49973848 = fieldWeight in 39, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.125 = fieldNorm(doc=39)
          0.34033537 = weight(abstract_txt:german in 39) [ClassicSimilarity], result of:
            0.34033537 = score(doc=39,freq=1.0), product of:
              0.43408963 = queryWeight, product of:
                3.8705804 = boost
                6.2721677 = idf(docFreq=221, maxDocs=43254)
                0.017880747 = queryNorm
              0.78402096 = fieldWeight in 39, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2721677 = idf(docFreq=221, maxDocs=43254)
                0.125 = fieldNorm(doc=39)
        0.24 = coord(6/25)
    
  2. Jahns, Y.: 20 years SWD : German subject authority data prepared for the future (2011) 0.24
    0.24405377 = sum of:
      0.24405377 = product of:
        0.8716206 = sum of:
          0.014470068 = weight(abstract_txt:with in 3267) [ClassicSimilarity], result of:
            0.014470068 = score(doc=3267,freq=2.0), product of:
              0.052164953 = queryWeight, product of:
                1.1620008 = boost
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.017880747 = queryNorm
              0.2773906 = fieldWeight in 3267, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.078125 = fieldNorm(doc=3267)
          0.09171594 = weight(abstract_txt:authority in 3267) [ClassicSimilarity], result of:
            0.09171594 = score(doc=3267,freq=2.0), product of:
              0.15607424 = queryWeight, product of:
                1.6411074 = boost
                5.3187375 = idf(docFreq=575, maxDocs=43254)
                0.017880747 = queryNorm
              0.587643 = fieldWeight in 3267, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.3187375 = idf(docFreq=575, maxDocs=43254)
                0.078125 = fieldNorm(doc=3267)
          0.10508834 = weight(abstract_txt:file in 3267) [ClassicSimilarity], result of:
            0.10508834 = score(doc=3267,freq=2.0), product of:
              0.17089829 = queryWeight, product of:
                1.7172766 = boost
                5.5655975 = idf(docFreq=449, maxDocs=43254)
                0.017880747 = queryNorm
              0.61491746 = fieldWeight in 3267, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5655975 = idf(docFreq=449, maxDocs=43254)
                0.078125 = fieldNorm(doc=3267)
          0.066946484 = weight(abstract_txt:subject in 3267) [ClassicSimilarity], result of:
            0.066946484 = score(doc=3267,freq=3.0), product of:
              0.12652797 = queryWeight, product of:
                1.8097154 = boost
                3.9101245 = idf(docFreq=2355, maxDocs=43254)
                0.017880747 = queryNorm
              0.52910423 = fieldWeight in 3267, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9101245 = idf(docFreq=2355, maxDocs=43254)
                0.078125 = fieldNorm(doc=3267)
          0.20995557 = weight(abstract_txt:notations in 3267) [ClassicSimilarity], result of:
            0.20995557 = score(doc=3267,freq=1.0), product of:
              0.34155682 = queryWeight, product of:
                2.427744 = boost
                7.8681827 = idf(docFreq=44, maxDocs=43254)
                0.017880747 = queryNorm
              0.61470175 = fieldWeight in 3267, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8681827 = idf(docFreq=44, maxDocs=43254)
                0.078125 = fieldNorm(doc=3267)
          0.08262733 = weight(abstract_txt:classification in 3267) [ClassicSimilarity], result of:
            0.08262733 = score(doc=3267,freq=1.0), product of:
              0.26454583 = queryWeight, product of:
                3.7006881 = boost
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.017880747 = queryNorm
              0.31233656 = fieldWeight in 3267, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.078125 = fieldNorm(doc=3267)
          0.30081683 = weight(abstract_txt:german in 3267) [ClassicSimilarity], result of:
            0.30081683 = score(doc=3267,freq=2.0), product of:
              0.43408963 = queryWeight, product of:
                3.8705804 = boost
                6.2721677 = idf(docFreq=221, maxDocs=43254)
                0.017880747 = queryNorm
              0.6929832 = fieldWeight in 3267, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2721677 = idf(docFreq=221, maxDocs=43254)
                0.078125 = fieldNorm(doc=3267)
        0.28 = coord(7/25)
    
  3. Jacobs, J.-H.; Mengel, T.; Müller, K.: Insights and Outlooks : a retrospective view on the CrissCross project (2011) 0.23
    0.22952351 = sum of:
      0.22952351 = product of:
        0.956348 = sum of:
          0.090794146 = weight(abstract_txt:authority in 1250) [ClassicSimilarity], result of:
            0.090794146 = score(doc=1250,freq=1.0), product of:
              0.15607424 = queryWeight, product of:
                1.6411074 = boost
                5.3187375 = idf(docFreq=575, maxDocs=43254)
                0.017880747 = queryNorm
              0.5817369 = fieldWeight in 1250, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3187375 = idf(docFreq=575, maxDocs=43254)
                0.109375 = fieldNorm(doc=1250)
          0.10403215 = weight(abstract_txt:file in 1250) [ClassicSimilarity], result of:
            0.10403215 = score(doc=1250,freq=1.0), product of:
              0.17089829 = queryWeight, product of:
                1.7172766 = boost
                5.5655975 = idf(docFreq=449, maxDocs=43254)
                0.017880747 = queryNorm
              0.60873723 = fieldWeight in 1250, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5655975 = idf(docFreq=449, maxDocs=43254)
                0.109375 = fieldNorm(doc=1250)
          0.0541122 = weight(abstract_txt:subject in 1250) [ClassicSimilarity], result of:
            0.0541122 = score(doc=1250,freq=1.0), product of:
              0.12652797 = queryWeight, product of:
                1.8097154 = boost
                3.9101245 = idf(docFreq=2355, maxDocs=43254)
                0.017880747 = queryNorm
              0.42766988 = fieldWeight in 1250, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9101245 = idf(docFreq=2355, maxDocs=43254)
                0.109375 = fieldNorm(doc=1250)
          0.2939378 = weight(abstract_txt:notations in 1250) [ClassicSimilarity], result of:
            0.2939378 = score(doc=1250,freq=1.0), product of:
              0.34155682 = queryWeight, product of:
                2.427744 = boost
                7.8681827 = idf(docFreq=44, maxDocs=43254)
                0.017880747 = queryNorm
              0.8605825 = fieldWeight in 1250, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8681827 = idf(docFreq=44, maxDocs=43254)
                0.109375 = fieldNorm(doc=1250)
          0.115678266 = weight(abstract_txt:classification in 1250) [ClassicSimilarity], result of:
            0.115678266 = score(doc=1250,freq=1.0), product of:
              0.26454583 = queryWeight, product of:
                3.7006881 = boost
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.017880747 = queryNorm
              0.43727118 = fieldWeight in 1250, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.109375 = fieldNorm(doc=1250)
          0.29779345 = weight(abstract_txt:german in 1250) [ClassicSimilarity], result of:
            0.29779345 = score(doc=1250,freq=1.0), product of:
              0.43408963 = queryWeight, product of:
                3.8705804 = boost
                6.2721677 = idf(docFreq=221, maxDocs=43254)
                0.017880747 = queryNorm
              0.68601835 = fieldWeight in 1250, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2721677 = idf(docFreq=221, maxDocs=43254)
                0.109375 = fieldNorm(doc=1250)
        0.24 = coord(6/25)
    
  4. Rolland-Thomas, P.; Mercure, G.: Subject access in a bilingual online catalogue (1989) 0.19
    0.18819292 = sum of:
      0.18819292 = product of:
        0.5881029 = sum of:
          0.014470068 = weight(abstract_txt:with in 1576) [ClassicSimilarity], result of:
            0.014470068 = score(doc=1576,freq=2.0), product of:
              0.052164953 = queryWeight, product of:
                1.1620008 = boost
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.017880747 = queryNorm
              0.2773906 = fieldWeight in 1576, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.078125 = fieldNorm(doc=1576)
          0.13630545 = weight(abstract_txt:reciprocal in 1576) [ClassicSimilarity], result of:
            0.13630545 = score(doc=1576,freq=1.0), product of:
              0.20325606 = queryWeight, product of:
                1.3242749 = boost
                8.583802 = idf(docFreq=21, maxDocs=43254)
                0.017880747 = queryNorm
              0.67060953 = fieldWeight in 1576, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.583802 = idf(docFreq=21, maxDocs=43254)
                0.078125 = fieldNorm(doc=1576)
          0.06449532 = weight(abstract_txt:records in 1576) [ClassicSimilarity], result of:
            0.06449532 = score(doc=1576,freq=3.0), product of:
              0.10781762 = queryWeight, product of:
                1.3640059 = boost
                4.420667 = idf(docFreq=1413, maxDocs=43254)
                0.017880747 = queryNorm
              0.59818906 = fieldWeight in 1576, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.420667 = idf(docFreq=1413, maxDocs=43254)
                0.078125 = fieldNorm(doc=1576)
          0.08552846 = weight(abstract_txt:automatic in 1576) [ClassicSimilarity], result of:
            0.08552846 = score(doc=1576,freq=2.0), product of:
              0.14897332 = queryWeight, product of:
                1.60334 = boost
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.017880747 = queryNorm
              0.5741193 = fieldWeight in 1576, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.078125 = fieldNorm(doc=1576)
          0.09171594 = weight(abstract_txt:authority in 1576) [ClassicSimilarity], result of:
            0.09171594 = score(doc=1576,freq=2.0), product of:
              0.15607424 = queryWeight, product of:
                1.6411074 = boost
                5.3187375 = idf(docFreq=575, maxDocs=43254)
                0.017880747 = queryNorm
              0.587643 = fieldWeight in 1576, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.3187375 = idf(docFreq=575, maxDocs=43254)
                0.078125 = fieldNorm(doc=1576)
          0.07430868 = weight(abstract_txt:file in 1576) [ClassicSimilarity], result of:
            0.07430868 = score(doc=1576,freq=1.0), product of:
              0.17089829 = queryWeight, product of:
                1.7172766 = boost
                5.5655975 = idf(docFreq=449, maxDocs=43254)
                0.017880747 = queryNorm
              0.4348123 = fieldWeight in 1576, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5655975 = idf(docFreq=449, maxDocs=43254)
                0.078125 = fieldNorm(doc=1576)
          0.03865157 = weight(abstract_txt:subject in 1576) [ClassicSimilarity], result of:
            0.03865157 = score(doc=1576,freq=1.0), product of:
              0.12652797 = queryWeight, product of:
                1.8097154 = boost
                3.9101245 = idf(docFreq=2355, maxDocs=43254)
                0.017880747 = queryNorm
              0.30547848 = fieldWeight in 1576, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9101245 = idf(docFreq=2355, maxDocs=43254)
                0.078125 = fieldNorm(doc=1576)
          0.08262733 = weight(abstract_txt:classification in 1576) [ClassicSimilarity], result of:
            0.08262733 = score(doc=1576,freq=1.0), product of:
              0.26454583 = queryWeight, product of:
                3.7006881 = boost
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.017880747 = queryNorm
              0.31233656 = fieldWeight in 1576, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.078125 = fieldNorm(doc=1576)
        0.32 = coord(8/25)
    
  5. Reiner, U.: DDC-based search in the data of the German National Bibliography (2008) 0.19
    0.18658048 = sum of:
      0.18658048 = product of:
        0.7774187 = sum of:
          0.010231884 = weight(abstract_txt:with in 4167) [ClassicSimilarity], result of:
            0.010231884 = score(doc=4167,freq=1.0), product of:
              0.052164953 = queryWeight, product of:
                1.1620008 = boost
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.017880747 = queryNorm
              0.19614479 = fieldWeight in 4167, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.078125 = fieldNorm(doc=4167)
          0.074472785 = weight(abstract_txt:records in 4167) [ClassicSimilarity], result of:
            0.074472785 = score(doc=4167,freq=4.0), product of:
              0.10781762 = queryWeight, product of:
                1.3640059 = boost
                4.420667 = idf(docFreq=1413, maxDocs=43254)
                0.017880747 = queryNorm
              0.69072926 = fieldWeight in 4167, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.420667 = idf(docFreq=1413, maxDocs=43254)
                0.078125 = fieldNorm(doc=4167)
          0.08552846 = weight(abstract_txt:automatic in 4167) [ClassicSimilarity], result of:
            0.08552846 = score(doc=4167,freq=2.0), product of:
              0.14897332 = queryWeight, product of:
                1.60334 = boost
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.017880747 = queryNorm
              0.5741193 = fieldWeight in 4167, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.078125 = fieldNorm(doc=4167)
          0.03865157 = weight(abstract_txt:subject in 4167) [ClassicSimilarity], result of:
            0.03865157 = score(doc=4167,freq=1.0), product of:
              0.12652797 = queryWeight, product of:
                1.8097154 = boost
                3.9101245 = idf(docFreq=2355, maxDocs=43254)
                0.017880747 = queryNorm
              0.30547848 = fieldWeight in 4167, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9101245 = idf(docFreq=2355, maxDocs=43254)
                0.078125 = fieldNorm(doc=4167)
          0.14311475 = weight(abstract_txt:classification in 4167) [ClassicSimilarity], result of:
            0.14311475 = score(doc=4167,freq=3.0), product of:
              0.26454583 = queryWeight, product of:
                3.7006881 = boost
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.017880747 = queryNorm
              0.5409828 = fieldWeight in 4167, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.078125 = fieldNorm(doc=4167)
          0.4254192 = weight(abstract_txt:german in 4167) [ClassicSimilarity], result of:
            0.4254192 = score(doc=4167,freq=4.0), product of:
              0.43408963 = queryWeight, product of:
                3.8705804 = boost
                6.2721677 = idf(docFreq=221, maxDocs=43254)
                0.017880747 = queryNorm
              0.9800262 = fieldWeight in 4167, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.2721677 = idf(docFreq=221, maxDocs=43254)
                0.078125 = fieldNorm(doc=4167)
        0.24 = coord(6/25)