Document (#30189)

Author
Kim, W.
Wilbur, W.J.
Title
Corpus-based statistical screening for content-bearing terms
Source
Journal of the American Society for Information Science and technology. 52(2001) no.3, S.247-259
Year
2001
Abstract
Kim and Wilber present three techniques for the algorithmic identification in text of content bearing terms and phrases intended for human use as entry points or hyperlinks. Using a set of 1,075 terms from MEDLINE evaluated on a zero to four, stop word to definite content word scale, they evaluate the ranked lists of their three methods based on their placement of content words in the top ranks. Data consist of the natural language elements of 304,057 MEDLINE records from 1996, and 173,252 Wall Street Journal records from the TIPSTER collection. Phrases are extracted by breaking at punctuation marks and stop words, normalized by lower casing, replacement of nonalphanumerics with spaces, and the reduction of multiple spaces. In the ``strength of context'' approach each document is a vector of binary values for each word or word pair. The words or word pairs are removed from all documents, and the Robertson, Spark Jones relevance weight for each term computed, negative weights replaced with zero, those below a randomness threshold ignored, and the remainder summed for each document, to yield a score for the document and finally to assign to the term the average document score for documents in which it occurred. The average of these word scores is assigned to the original phrase. The ``frequency clumping'' approach defines a random phrase as one whose distribution among documents is Poisson in character. A pvalue, the probability that a phrase frequency of occurrence would be equal to, or less than, Poisson expectations is computed, and a score assigned which is the negative log of that value. In the ``database comparison'' approach if a phrase occurring in a document allows prediction that the document is in MEDLINE rather that in the Wall Street Journal, it is considered to be content bearing for MEDLINE. The score is computed by dividing the number of occurrences of the term in MEDLINE by occurrences in the Journal, and taking the product of all these values. The one hundred top and bottom ranked phrases that occurred in at least 500 documents were collected for each method. The union set had 476 phrases. A second selection was made of two word phrases occurring each in only three documents with a union of 599 phrases. A judge then ranked the two sets of terms as to subject specificity on a 0 to 4 scale. Precision was the average subject specificity of the first r ranks and recall the fraction of the subject specific phrases in the first r ranks and eleven point average precision was used as a summary measure. The three methods all move content bearing terms forward in the lists as does the use of the sum of the logs of the three methods.
Theme
Computerlinguistik
Object
Medline

Similar documents (author)

  1. Wilbur, W.J.: Global term weights for document retrieval learned from TREC data (2001) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:wilbur in 2647) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 2647, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=2647)
    
  2. Wilbur, W.J.: Human subjectivity and performance limits in document retrieval (1996) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:wilbur in 6607) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 6607, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=6607)
    
  3. Wilbur, W.J.: ¬A comparison of group and individual performance among subject experts and untrained workers at the document retrieval task (1998) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:wilbur in 3263) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 3263, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=3263)
    
  4. Wilbur, W.J.: Human subjectivity and performance limits in document retrieval (1999) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:wilbur in 4539) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 4539, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=4539)
    
  5. Wilbur, W.J.: ¬A retrieval system based on automatic relevance weighting of search terms (1992) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:wilbur in 5269) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 5269, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=5269)
    

Similar documents (content)

  1. Fagan, J.L.: ¬The effectiveness of a nonsyntactic approach to automatic phrase indexing for document retrieval (1989) 0.20
    0.20422691 = sum of:
      0.20422691 = product of:
        0.8509455 = sum of:
          0.036114644 = weight(abstract_txt:words in 1845) [ClassicSimilarity], result of:
            0.036114644 = score(doc=1845,freq=1.0), product of:
              0.10794575 = queryWeight, product of:
                1.0780194 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.01870601 = queryNorm
              0.33456293 = fieldWeight in 1845, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.0625 = fieldNorm(doc=1845)
          0.059563663 = weight(abstract_txt:content in 1845) [ClassicSimilarity], result of:
            0.059563663 = score(doc=1845,freq=3.0), product of:
              0.13163574 = queryWeight, product of:
                1.6835487 = boost
                4.17991 = idf(docFreq=1838, maxDocs=44218)
                0.01870601 = queryNorm
              0.45248854 = fieldWeight in 1845, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.17991 = idf(docFreq=1838, maxDocs=44218)
                0.0625 = fieldNorm(doc=1845)
          0.08328537 = weight(abstract_txt:document in 1845) [ClassicSimilarity], result of:
            0.08328537 = score(doc=1845,freq=5.0), product of:
              0.13882971 = queryWeight, product of:
                1.7289402 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.01870601 = queryNorm
              0.59991026 = fieldWeight in 1845, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=1845)
          0.2754252 = weight(abstract_txt:phrase in 1845) [ClassicSimilarity], result of:
            0.2754252 = score(doc=1845,freq=6.0), product of:
              0.25332704 = queryWeight, product of:
                1.9069264 = boost
                7.1017675 = idf(docFreq=98, maxDocs=44218)
                0.01870601 = queryNorm
              1.0872318 = fieldWeight in 1845, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.1017675 = idf(docFreq=98, maxDocs=44218)
                0.0625 = fieldNorm(doc=1845)
          0.08821862 = weight(abstract_txt:word in 1845) [ClassicSimilarity], result of:
            0.08821862 = score(doc=1845,freq=1.0), product of:
              0.25968632 = queryWeight, product of:
                2.5540931 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.01870601 = queryNorm
              0.33971223 = fieldWeight in 1845, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=1845)
          0.30833796 = weight(abstract_txt:phrases in 1845) [ClassicSimilarity], result of:
            0.30833796 = score(doc=1845,freq=3.0), product of:
              0.4146864 = queryWeight, product of:
                3.2275434 = boost
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.01870601 = queryNorm
              0.7435449 = fieldWeight in 1845, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.0625 = fieldNorm(doc=1845)
        0.24 = coord(6/25)
    
  2. Beatty, S.: Subject enrichment using contents or index terms : the Australian Defence Force Academy experience (1992) 0.20
    0.1974327 = sum of:
      0.1974327 = product of:
        0.70511675 = sum of:
          0.045143306 = weight(abstract_txt:words in 4649) [ClassicSimilarity], result of:
            0.045143306 = score(doc=4649,freq=1.0), product of:
              0.10794575 = queryWeight, product of:
                1.0780194 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.01870601 = queryNorm
              0.41820365 = fieldWeight in 4649, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.078125 = fieldNorm(doc=4649)
          0.06487378 = weight(abstract_txt:terms in 4649) [ClassicSimilarity], result of:
            0.06487378 = score(doc=4649,freq=4.0), product of:
              0.10267207 = queryWeight, product of:
                1.3572953 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01870601 = queryNorm
              0.6318542 = fieldWeight in 4649, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=4649)
          0.08100259 = weight(abstract_txt:average in 4649) [ClassicSimilarity], result of:
            0.08100259 = score(doc=4649,freq=1.0), product of:
              0.17543739 = queryWeight, product of:
                1.586917 = boost
                5.90999 = idf(docFreq=325, maxDocs=44218)
                0.01870601 = queryNorm
              0.46171796 = fieldWeight in 4649, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.90999 = idf(docFreq=325, maxDocs=44218)
                0.078125 = fieldNorm(doc=4649)
          0.041126687 = weight(abstract_txt:each in 4649) [ClassicSimilarity], result of:
            0.041126687 = score(doc=4649,freq=1.0), product of:
              0.12781125 = queryWeight, product of:
                1.658912 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.01870601 = queryNorm
              0.32177672 = fieldWeight in 4649, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.078125 = fieldNorm(doc=4649)
          0.042986374 = weight(abstract_txt:content in 4649) [ClassicSimilarity], result of:
            0.042986374 = score(doc=4649,freq=1.0), product of:
              0.13163574 = queryWeight, product of:
                1.6835487 = boost
                4.17991 = idf(docFreq=1838, maxDocs=44218)
                0.01870601 = queryNorm
              0.3265555 = fieldWeight in 4649, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.17991 = idf(docFreq=1838, maxDocs=44218)
                0.078125 = fieldNorm(doc=4649)
          0.20746024 = weight(abstract_txt:bearing in 4649) [ClassicSimilarity], result of:
            0.20746024 = score(doc=4649,freq=1.0), product of:
              0.3284073 = queryWeight, product of:
                2.1711986 = boost
                8.085969 = idf(docFreq=36, maxDocs=44218)
                0.01870601 = queryNorm
              0.6317163 = fieldWeight in 4649, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.085969 = idf(docFreq=36, maxDocs=44218)
                0.078125 = fieldNorm(doc=4649)
          0.22252376 = weight(abstract_txt:phrases in 4649) [ClassicSimilarity], result of:
            0.22252376 = score(doc=4649,freq=1.0), product of:
              0.4146864 = queryWeight, product of:
                3.2275434 = boost
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.01870601 = queryNorm
              0.5366073 = fieldWeight in 4649, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.078125 = fieldNorm(doc=4649)
        0.28 = coord(7/25)
    
  3. Wordhoard (o.J.) 0.19
    0.1940908 = sum of:
      0.1940908 = product of:
        0.970454 = sum of:
          0.063842274 = weight(abstract_txt:words in 3922) [ClassicSimilarity], result of:
            0.063842274 = score(doc=3922,freq=2.0), product of:
              0.10794575 = queryWeight, product of:
                1.0780194 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.01870601 = queryNorm
              0.5914293 = fieldWeight in 3922, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.078125 = fieldNorm(doc=3922)
          0.042246625 = weight(abstract_txt:three in 3922) [ClassicSimilarity], result of:
            0.042246625 = score(doc=3922,freq=1.0), product of:
              0.122448705 = queryWeight, product of:
                1.482263 = boost
                4.41619 = idf(docFreq=1451, maxDocs=44218)
                0.01870601 = queryNorm
              0.34501487 = fieldWeight in 3922, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.41619 = idf(docFreq=1451, maxDocs=44218)
                0.078125 = fieldNorm(doc=3922)
          0.198771 = weight(abstract_txt:phrase in 3922) [ClassicSimilarity], result of:
            0.198771 = score(doc=3922,freq=2.0), product of:
              0.25332704 = queryWeight, product of:
                1.9069264 = boost
                7.1017675 = idf(docFreq=98, maxDocs=44218)
                0.01870601 = queryNorm
              0.78464186 = fieldWeight in 3922, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1017675 = idf(docFreq=98, maxDocs=44218)
                0.078125 = fieldNorm(doc=3922)
          0.22054656 = weight(abstract_txt:word in 3922) [ClassicSimilarity], result of:
            0.22054656 = score(doc=3922,freq=4.0), product of:
              0.25968632 = queryWeight, product of:
                2.5540931 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.01870601 = queryNorm
              0.8492806 = fieldWeight in 3922, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.078125 = fieldNorm(doc=3922)
          0.44504753 = weight(abstract_txt:phrases in 3922) [ClassicSimilarity], result of:
            0.44504753 = score(doc=3922,freq=4.0), product of:
              0.4146864 = queryWeight, product of:
                3.2275434 = boost
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.01870601 = queryNorm
              1.0732147 = fieldWeight in 3922, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.078125 = fieldNorm(doc=3922)
        0.2 = coord(5/25)
    
  4. WordHoard: finding multiword units (20??) 0.19
    0.1940908 = sum of:
      0.1940908 = product of:
        0.970454 = sum of:
          0.063842274 = weight(abstract_txt:words in 1123) [ClassicSimilarity], result of:
            0.063842274 = score(doc=1123,freq=2.0), product of:
              0.10794575 = queryWeight, product of:
                1.0780194 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.01870601 = queryNorm
              0.5914293 = fieldWeight in 1123, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.078125 = fieldNorm(doc=1123)
          0.042246625 = weight(abstract_txt:three in 1123) [ClassicSimilarity], result of:
            0.042246625 = score(doc=1123,freq=1.0), product of:
              0.122448705 = queryWeight, product of:
                1.482263 = boost
                4.41619 = idf(docFreq=1451, maxDocs=44218)
                0.01870601 = queryNorm
              0.34501487 = fieldWeight in 1123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.41619 = idf(docFreq=1451, maxDocs=44218)
                0.078125 = fieldNorm(doc=1123)
          0.198771 = weight(abstract_txt:phrase in 1123) [ClassicSimilarity], result of:
            0.198771 = score(doc=1123,freq=2.0), product of:
              0.25332704 = queryWeight, product of:
                1.9069264 = boost
                7.1017675 = idf(docFreq=98, maxDocs=44218)
                0.01870601 = queryNorm
              0.78464186 = fieldWeight in 1123, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1017675 = idf(docFreq=98, maxDocs=44218)
                0.078125 = fieldNorm(doc=1123)
          0.22054656 = weight(abstract_txt:word in 1123) [ClassicSimilarity], result of:
            0.22054656 = score(doc=1123,freq=4.0), product of:
              0.25968632 = queryWeight, product of:
                2.5540931 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.01870601 = queryNorm
              0.8492806 = fieldWeight in 1123, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.078125 = fieldNorm(doc=1123)
          0.44504753 = weight(abstract_txt:phrases in 1123) [ClassicSimilarity], result of:
            0.44504753 = score(doc=1123,freq=4.0), product of:
              0.4146864 = queryWeight, product of:
                3.2275434 = boost
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.01870601 = queryNorm
              1.0732147 = fieldWeight in 1123, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.078125 = fieldNorm(doc=1123)
        0.2 = coord(5/25)
    
  5. Ruthven, I.; Lalmas, M.; Rijsbergen, K. van: Combining and selecting characteristics of information use (2002) 0.19
    0.19321418 = sum of:
      0.19321418 = product of:
        0.53670603 = sum of:
          0.06879589 = weight(abstract_txt:specificity in 5208) [ClassicSimilarity], result of:
            0.06879589 = score(doc=5208,freq=2.0), product of:
              0.1393297 = queryWeight, product of:
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.01870601 = queryNorm
              0.4937633 = fieldWeight in 5208, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.046875 = fieldNorm(doc=5208)
          0.027085984 = weight(abstract_txt:words in 5208) [ClassicSimilarity], result of:
            0.027085984 = score(doc=5208,freq=1.0), product of:
              0.10794575 = queryWeight, product of:
                1.0780194 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.01870601 = queryNorm
              0.2509222 = fieldWeight in 5208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.046875 = fieldNorm(doc=5208)
          0.06620031 = weight(abstract_txt:ranked in 5208) [ClassicSimilarity], result of:
            0.06620031 = score(doc=5208,freq=2.0), product of:
              0.1554554 = queryWeight, product of:
                1.2936795 = boost
                6.4238877 = idf(docFreq=194, maxDocs=44218)
                0.01870601 = queryNorm
              0.42584762 = fieldWeight in 5208, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.4238877 = idf(docFreq=194, maxDocs=44218)
                0.046875 = fieldNorm(doc=5208)
          0.038924262 = weight(abstract_txt:terms in 5208) [ClassicSimilarity], result of:
            0.038924262 = score(doc=5208,freq=4.0), product of:
              0.10267207 = queryWeight, product of:
                1.3572953 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01870601 = queryNorm
              0.37911248 = fieldWeight in 5208, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.046875 = fieldNorm(doc=5208)
          0.04606685 = weight(abstract_txt:documents in 5208) [ClassicSimilarity], result of:
            0.04606685 = score(doc=5208,freq=5.0), product of:
              0.10664186 = queryWeight, product of:
                1.3832861 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.01870601 = queryNorm
              0.43197718 = fieldWeight in 5208, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.046875 = fieldNorm(doc=5208)
          0.07708892 = weight(abstract_txt:ranks in 5208) [ClassicSimilarity], result of:
            0.07708892 = score(doc=5208,freq=1.0), product of:
              0.21678893 = queryWeight, product of:
                1.527715 = boost
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.01870601 = queryNorm
              0.35559437 = fieldWeight in 5208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.046875 = fieldNorm(doc=5208)
          0.08418036 = weight(abstract_txt:average in 5208) [ClassicSimilarity], result of:
            0.08418036 = score(doc=5208,freq=3.0), product of:
              0.17543739 = queryWeight, product of:
                1.586917 = boost
                5.90999 = idf(docFreq=325, maxDocs=44218)
                0.01870601 = queryNorm
              0.47983137 = fieldWeight in 5208, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.90999 = idf(docFreq=325, maxDocs=44218)
                0.046875 = fieldNorm(doc=5208)
          0.049352024 = weight(abstract_txt:each in 5208) [ClassicSimilarity], result of:
            0.049352024 = score(doc=5208,freq=4.0), product of:
              0.12781125 = queryWeight, product of:
                1.658912 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.01870601 = queryNorm
              0.38613206 = fieldWeight in 5208, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.046875 = fieldNorm(doc=5208)
          0.07901143 = weight(abstract_txt:document in 5208) [ClassicSimilarity], result of:
            0.07901143 = score(doc=5208,freq=8.0), product of:
              0.13882971 = queryWeight, product of:
                1.7289402 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.01870601 = queryNorm
              0.5691248 = fieldWeight in 5208, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.046875 = fieldNorm(doc=5208)
        0.36 = coord(9/25)