Document (#41038)

Author
Aldebei, K.
He, X.
Jia, W.
Yeh, W.
Title
SUDMAD: Sequential and unsupervised decomposition of a multi-author document based on a hidden markov model
Source
Journal of the Association for Information Science and Technology. 69(2018) no.2, S.201-214
Year
2018
Abstract
Decomposing a document written by more than one author into sentences based on authorship is of great significance due to the increasing demand for plagiarism detection, forensic analysis, civil law (i.e., disputed copyright issues), and intelligence issues that involve disputed anonymous documents. Among existing studies for document decomposition, some were limited by specific languages, according to topics or restricted to a document of two authors, and their accuracies have big room for improvement. In this paper, we consider the contextual correlation hidden among sentences and propose an algorithm for Sequential and Unsupervised Decomposition of a Multi-Author Document (SUDMAD) written in any language, disregarding topics, through the construction of a Hidden Markov Model (HMM) reflecting the authors' writing styles. To build and learn such a model, an unsupervised, statistical approach is first proposed to estimate the initial values of HMM parameters of a preliminary model, which does not require the availability of any information of author's or document's context other than how many authors contributed to writing the document. To further boost the performance of this approach, a boosted HMM learning procedure is proposed next, where the initial classification results are used to create labeled training data to learn a more accurate HMM. Moreover, the contextual relationship among sentences is further utilized to refine the classification results. Our proposed approach is empirically evaluated on three benchmark datasets that are widely used for authorship analysis of documents. Comparisons with recent state-of-the-art approaches are also presented to demonstrate the significance of our new ideas and the superior performance of our approach.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23956/full.

Similar documents (content)

  1. Giannella, C.: ¬An improved algorithm for unsupervised decomposition of a multi-author document (2016) 0.37
    0.37269816 = sum of:
      0.37269816 = product of:
        1.0352726 = sum of:
          0.05673178 = weight(abstract_txt:written in 2642) [ClassicSimilarity], result of:
            0.05673178 = score(doc=2642,freq=1.0), product of:
              0.1256195 = queryWeight, product of:
                1.1856626 = boost
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.018328067 = queryNorm
              0.45161602 = fieldWeight in 2642, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.078125 = fieldNorm(doc=2642)
          0.06168703 = weight(abstract_txt:multi in 2642) [ClassicSimilarity], result of:
            0.06168703 = score(doc=2642,freq=1.0), product of:
              0.1328318 = queryWeight, product of:
                1.2192243 = boost
                5.9443145 = idf(docFreq=314, maxDocs=44218)
                0.018328067 = queryNorm
              0.46439958 = fieldWeight in 2642, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9443145 = idf(docFreq=314, maxDocs=44218)
                0.078125 = fieldNorm(doc=2642)
          0.04425054 = weight(abstract_txt:authors in 2642) [ClassicSimilarity], result of:
            0.04425054 = score(doc=2642,freq=1.0), product of:
              0.12184722 = queryWeight, product of:
                1.4301647 = boost
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.018328067 = queryNorm
              0.36316413 = fieldWeight in 2642, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.078125 = fieldNorm(doc=2642)
          0.094117895 = weight(abstract_txt:author in 2642) [ClassicSimilarity], result of:
            0.094117895 = score(doc=2642,freq=3.0), product of:
              0.13972612 = queryWeight, product of:
                1.5315 = boost
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.018328067 = queryNorm
              0.6735884 = fieldWeight in 2642, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.078125 = fieldNorm(doc=2642)
          0.053450122 = weight(abstract_txt:approach in 2642) [ClassicSimilarity], result of:
            0.053450122 = score(doc=2642,freq=3.0), product of:
              0.105464965 = queryWeight, product of:
                1.5363908 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.018328067 = queryNorm
              0.5068045 = fieldWeight in 2642, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.078125 = fieldNorm(doc=2642)
          0.21336345 = weight(abstract_txt:sentences in 2642) [ClassicSimilarity], result of:
            0.21336345 = score(doc=2642,freq=2.0), product of:
              0.2760196 = queryWeight, product of:
                2.152525 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.018328067 = queryNorm
              0.7730011 = fieldWeight in 2642, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.078125 = fieldNorm(doc=2642)
          0.19486485 = weight(abstract_txt:unsupervised in 2642) [ClassicSimilarity], result of:
            0.19486485 = score(doc=2642,freq=1.0), product of:
              0.32735997 = queryWeight, product of:
                2.3441803 = boost
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.018328067 = queryNorm
              0.5952617 = fieldWeight in 2642, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.078125 = fieldNorm(doc=2642)
          0.21824963 = weight(abstract_txt:decomposition in 2642) [ClassicSimilarity], result of:
            0.21824963 = score(doc=2642,freq=1.0), product of:
              0.3530522 = queryWeight, product of:
                2.4344323 = boost
                7.912698 = idf(docFreq=43, maxDocs=44218)
                0.018328067 = queryNorm
              0.6181795 = fieldWeight in 2642, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.912698 = idf(docFreq=43, maxDocs=44218)
                0.078125 = fieldNorm(doc=2642)
          0.09855725 = weight(abstract_txt:document in 2642) [ClassicSimilarity], result of:
            0.09855725 = score(doc=2642,freq=2.0), product of:
              0.207808 = queryWeight, product of:
                2.6413403 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.018328067 = queryNorm
              0.4742707 = fieldWeight in 2642, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.078125 = fieldNorm(doc=2642)
        0.36 = coord(9/25)
    
  2. Kocher, M.; Savoy, J.: ¬A simple and efficient algorithm for authorship verification (2017) 0.24
    0.23580238 = sum of:
      0.23580238 = product of:
        0.84215134 = sum of:
          0.05673178 = weight(abstract_txt:written in 3330) [ClassicSimilarity], result of:
            0.05673178 = score(doc=3330,freq=1.0), product of:
              0.1256195 = queryWeight, product of:
                1.1856626 = boost
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.018328067 = queryNorm
              0.45161602 = fieldWeight in 3330, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.078125 = fieldNorm(doc=3330)
          0.043140832 = weight(abstract_txt:proposed in 3330) [ClassicSimilarity], result of:
            0.043140832 = score(doc=3330,freq=1.0), product of:
              0.1198015 = queryWeight, product of:
                1.4181081 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.018328067 = queryNorm
              0.36010262 = fieldWeight in 3330, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.078125 = fieldNorm(doc=3330)
          0.09980537 = weight(abstract_txt:authorship in 3330) [ClassicSimilarity], result of:
            0.09980537 = score(doc=3330,freq=1.0), product of:
              0.18306644 = queryWeight, product of:
                1.4313208 = boost
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.018328067 = queryNorm
              0.5451866 = fieldWeight in 3330, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.078125 = fieldNorm(doc=3330)
          0.054338988 = weight(abstract_txt:author in 3330) [ClassicSimilarity], result of:
            0.054338988 = score(doc=3330,freq=1.0), product of:
              0.13972612 = queryWeight, product of:
                1.5315 = boost
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.018328067 = queryNorm
              0.38889644 = fieldWeight in 3330, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.078125 = fieldNorm(doc=3330)
          0.037205476 = weight(abstract_txt:model in 3330) [ClassicSimilarity], result of:
            0.037205476 = score(doc=3330,freq=1.0), product of:
              0.11946868 = queryWeight, product of:
                1.6352141 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.018328067 = queryNorm
              0.31142452 = fieldWeight in 3330, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.078125 = fieldNorm(doc=3330)
          0.35606402 = weight(abstract_txt:disputed in 3330) [ClassicSimilarity], result of:
            0.35606402 = score(doc=3330,freq=2.0), product of:
              0.3392461 = queryWeight, product of:
                1.9484533 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.018328067 = queryNorm
              1.0495744 = fieldWeight in 3330, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.078125 = fieldNorm(doc=3330)
          0.19486485 = weight(abstract_txt:unsupervised in 3330) [ClassicSimilarity], result of:
            0.19486485 = score(doc=3330,freq=1.0), product of:
              0.32735997 = queryWeight, product of:
                2.3441803 = boost
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.018328067 = queryNorm
              0.5952617 = fieldWeight in 3330, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.078125 = fieldNorm(doc=3330)
        0.28 = coord(7/25)
    
  3. Koppel, M.; Winter, Y.: Determining if two documents are written by the same author (2014) 0.17
    0.1734614 = sum of:
      0.1734614 = product of:
        0.7227559 = sum of:
          0.079424486 = weight(abstract_txt:written in 1602) [ClassicSimilarity], result of:
            0.079424486 = score(doc=1602,freq=1.0), product of:
              0.1256195 = queryWeight, product of:
                1.1856626 = boost
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.018328067 = queryNorm
              0.6322624 = fieldWeight in 1602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.109375 = fieldNorm(doc=1602)
          0.057151802 = weight(abstract_txt:among in 1602) [ClassicSimilarity], result of:
            0.057151802 = score(doc=1602,freq=1.0), product of:
              0.11547053 = queryWeight, product of:
                1.392239 = boost
                4.5252304 = idf(docFreq=1301, maxDocs=44218)
                0.018328067 = queryNorm
              0.49494708 = fieldWeight in 1602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5252304 = idf(docFreq=1301, maxDocs=44218)
                0.109375 = fieldNorm(doc=1602)
          0.13972752 = weight(abstract_txt:authorship in 1602) [ClassicSimilarity], result of:
            0.13972752 = score(doc=1602,freq=1.0), product of:
              0.18306644 = queryWeight, product of:
                1.4313208 = boost
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.018328067 = queryNorm
              0.7632612 = fieldWeight in 1602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.109375 = fieldNorm(doc=1602)
          0.076074585 = weight(abstract_txt:author in 1602) [ClassicSimilarity], result of:
            0.076074585 = score(doc=1602,freq=1.0), product of:
              0.13972612 = queryWeight, product of:
                1.5315 = boost
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.018328067 = queryNorm
              0.544455 = fieldWeight in 1602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.109375 = fieldNorm(doc=1602)
          0.2728108 = weight(abstract_txt:unsupervised in 1602) [ClassicSimilarity], result of:
            0.2728108 = score(doc=1602,freq=1.0), product of:
              0.32735997 = queryWeight, product of:
                2.3441803 = boost
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.018328067 = queryNorm
              0.8333664 = fieldWeight in 1602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.109375 = fieldNorm(doc=1602)
          0.097566694 = weight(abstract_txt:document in 1602) [ClassicSimilarity], result of:
            0.097566694 = score(doc=1602,freq=1.0), product of:
              0.207808 = queryWeight, product of:
                2.6413403 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.018328067 = queryNorm
              0.46950403 = fieldWeight in 1602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.109375 = fieldNorm(doc=1602)
        0.24 = coord(6/25)
    
  4. Savoy, J.: Estimating the probability of an authorship attribution (2016) 0.16
    0.16427933 = sum of:
      0.16427933 = product of:
        0.5867119 = sum of:
          0.04880828 = weight(abstract_txt:proposed in 2937) [ClassicSimilarity], result of:
            0.04880828 = score(doc=2937,freq=2.0), product of:
              0.1198015 = queryWeight, product of:
                1.4181081 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.018328067 = queryNorm
              0.4074096 = fieldWeight in 2937, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.0625 = fieldNorm(doc=2937)
          0.03540043 = weight(abstract_txt:authors in 2937) [ClassicSimilarity], result of:
            0.03540043 = score(doc=2937,freq=1.0), product of:
              0.12184722 = queryWeight, product of:
                1.4301647 = boost
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.018328067 = queryNorm
              0.2905313 = fieldWeight in 2937, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.0625 = fieldNorm(doc=2937)
          0.15968859 = weight(abstract_txt:authorship in 2937) [ClassicSimilarity], result of:
            0.15968859 = score(doc=2937,freq=4.0), product of:
              0.18306644 = queryWeight, product of:
                1.4313208 = boost
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.018328067 = queryNorm
              0.87229854 = fieldWeight in 2937, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.0625 = fieldNorm(doc=2937)
          0.08694238 = weight(abstract_txt:author in 2937) [ClassicSimilarity], result of:
            0.08694238 = score(doc=2937,freq=4.0), product of:
              0.13972612 = queryWeight, product of:
                1.5315 = boost
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.018328067 = queryNorm
              0.6222343 = fieldWeight in 2937, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.0625 = fieldNorm(doc=2937)
          0.024687555 = weight(abstract_txt:approach in 2937) [ClassicSimilarity], result of:
            0.024687555 = score(doc=2937,freq=1.0), product of:
              0.105464965 = queryWeight, product of:
                1.5363908 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.018328067 = queryNorm
              0.234083 = fieldWeight in 2937, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0625 = fieldNorm(doc=2937)
          0.029764382 = weight(abstract_txt:model in 2937) [ClassicSimilarity], result of:
            0.029764382 = score(doc=2937,freq=1.0), product of:
              0.11946868 = queryWeight, product of:
                1.6352141 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.018328067 = queryNorm
              0.24913962 = fieldWeight in 2937, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.0625 = fieldNorm(doc=2937)
          0.20142022 = weight(abstract_txt:disputed in 2937) [ClassicSimilarity], result of:
            0.20142022 = score(doc=2937,freq=1.0), product of:
              0.3392461 = queryWeight, product of:
                1.9484533 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.018328067 = queryNorm
              0.5937289 = fieldWeight in 2937, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.0625 = fieldNorm(doc=2937)
        0.28 = coord(7/25)
    
  5. Kar, M.; Nunes, S.; Ribeiro, C.: Summarization of changes in dynamic text collections using Latent Dirichlet Allocation model (2015) 0.15
    0.15265608 = sum of:
      0.15265608 = product of:
        0.47705024 = sum of:
          0.08562292 = weight(abstract_txt:disregarding in 2676) [ClassicSimilarity], result of:
            0.08562292 = score(doc=2676,freq=1.0), product of:
              0.1844118 = queryWeight, product of:
                1.0158087 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.018328067 = queryNorm
              0.46430284 = fieldWeight in 2676, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.046875 = fieldNorm(doc=2676)
          0.023185574 = weight(abstract_txt:topics in 2676) [ClassicSimilarity], result of:
            0.023185574 = score(doc=2676,freq=1.0), product of:
              0.09724872 = queryWeight, product of:
                1.0432167 = boost
                5.086191 = idf(docFreq=742, maxDocs=44218)
                0.018328067 = queryNorm
              0.23841521 = fieldWeight in 2676, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.086191 = idf(docFreq=742, maxDocs=44218)
                0.046875 = fieldNorm(doc=2676)
          0.0258845 = weight(abstract_txt:proposed in 2676) [ClassicSimilarity], result of:
            0.0258845 = score(doc=2676,freq=1.0), product of:
              0.1198015 = queryWeight, product of:
                1.4181081 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.018328067 = queryNorm
              0.21606156 = fieldWeight in 2676, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.046875 = fieldNorm(doc=2676)
          0.026185105 = weight(abstract_txt:approach in 2676) [ClassicSimilarity], result of:
            0.026185105 = score(doc=2676,freq=2.0), product of:
              0.105464965 = queryWeight, product of:
                1.5363908 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.018328067 = queryNorm
              0.24828249 = fieldWeight in 2676, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.046875 = fieldNorm(doc=2676)
          0.038665067 = weight(abstract_txt:model in 2676) [ClassicSimilarity], result of:
            0.038665067 = score(doc=2676,freq=3.0), product of:
              0.11946868 = queryWeight, product of:
                1.6352141 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.018328067 = queryNorm
              0.32364187 = fieldWeight in 2676, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.046875 = fieldNorm(doc=2676)
          0.090522446 = weight(abstract_txt:sentences in 2676) [ClassicSimilarity], result of:
            0.090522446 = score(doc=2676,freq=1.0), product of:
              0.2760196 = queryWeight, product of:
                2.152525 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.018328067 = queryNorm
              0.3279566 = fieldWeight in 2676, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.046875 = fieldNorm(doc=2676)
          0.093485005 = weight(abstract_txt:hidden in 2676) [ClassicSimilarity], result of:
            0.093485005 = score(doc=2676,freq=1.0), product of:
              0.28200948 = queryWeight, product of:
                2.1757555 = boost
                7.071914 = idf(docFreq=101, maxDocs=44218)
                0.018328067 = queryNorm
              0.33149597 = fieldWeight in 2676, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.071914 = idf(docFreq=101, maxDocs=44218)
                0.046875 = fieldNorm(doc=2676)
          0.093499616 = weight(abstract_txt:document in 2676) [ClassicSimilarity], result of:
            0.093499616 = score(doc=2676,freq=5.0), product of:
              0.207808 = queryWeight, product of:
                2.6413403 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.018328067 = queryNorm
              0.4499327 = fieldWeight in 2676, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.046875 = fieldNorm(doc=2676)
        0.32 = coord(8/25)