Document (#41674)

Author
Doval, Y.
Gómez-Rodríguez, C.
Title
Comparing neural- and N-gram-based language models for word segmentation
Source
Journal of the Association for Information Science and Technology. 70(2019) no.2, S.187-197
Year
2019
Abstract
Word segmentation is the task of inserting or deleting word boundary characters in order to separate character sequences that correspond to words in some language. In this article we propose an approach based on a beam search algorithm and a language model working at the byte/character level, the latter component implemented either as an n-gram model or a recurrent neural network. The resulting system analyzes the text input with no word boundaries one token at a time, which can be a character or a byte, and uses the information gathered by the language model to determine if a boundary must be placed in the current position or not. Our aim is to use this system in a preprocessing step for a microtext normalization system. This means that it needs to effectively cope with the data sparsity present on this kind of texts. We also strove to surpass the performance of two readily available word segmentation systems: The well-known and accessible Word Breaker by Microsoft, and the Python module WordSegment by Grant Jenks. The results show that we have met our objectives, and we hope to continue to improve both the precision and the efficiency of our system in the future.
Content
Vgl.: https://onlinelibrary.wiley.com/doi/10.1002/asi.24082.
Theme
Computerlinguistik

Similar documents (author)

  1. Cuesta, P.; Gómez, A.M.; Rodríguez, F.J.: Using agents for information retrieval (2003) 4.38
    4.3773203 = sum of:
      4.3773203 = sum of:
        1.8072783 = weight(author_txt:rodríguez in 3743) [ClassicSimilarity], result of:
          1.8072783 = score(doc=3743,freq=1.0), product of:
            0.6202761 = queryWeight, product of:
              7.7697797 = idf(docFreq=49, maxDocs=43556)
              0.079831876 = queryNorm
            2.9136674 = fieldWeight in 3743, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.7697797 = idf(docFreq=49, maxDocs=43556)
              0.375 = fieldNorm(doc=3743)
        2.570042 = weight(author_txt:gómez in 3743) [ClassicSimilarity], result of:
          2.570042 = score(doc=3743,freq=1.0), product of:
            0.7843836 = queryWeight, product of:
              1.1245317 = boost
              8.737364 = idf(docFreq=18, maxDocs=43556)
              0.079831876 = queryNorm
            3.2765114 = fieldWeight in 3743, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.737364 = idf(docFreq=18, maxDocs=43556)
              0.375 = fieldNorm(doc=3743)
    
  2. Vilares, D.; Alonso, M.A.; Gómez-Rodríguez, C.: On the usefulness of lexical and syntactic processing in polarity classification of Twitter messages (2015) 4.38
    4.3773203 = sum of:
      4.3773203 = sum of:
        1.8072783 = weight(author_txt:rodríguez in 4159) [ClassicSimilarity], result of:
          1.8072783 = score(doc=4159,freq=1.0), product of:
            0.6202761 = queryWeight, product of:
              7.7697797 = idf(docFreq=49, maxDocs=43556)
              0.079831876 = queryNorm
            2.9136674 = fieldWeight in 4159, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.7697797 = idf(docFreq=49, maxDocs=43556)
              0.375 = fieldNorm(doc=4159)
        2.570042 = weight(author_txt:gómez in 4159) [ClassicSimilarity], result of:
          2.570042 = score(doc=4159,freq=1.0), product of:
            0.7843836 = queryWeight, product of:
              1.1245317 = boost
              8.737364 = idf(docFreq=18, maxDocs=43556)
              0.079831876 = queryNorm
            3.2765114 = fieldWeight in 4159, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.737364 = idf(docFreq=18, maxDocs=43556)
              0.375 = fieldNorm(doc=4159)
    
  3. Olmeda-Gómez, C.; Perianes-Rodríguez, A.; Ovalle-Perandones, M.A.: Mapas de ciencias multidisciplinares : la biología molecular en la Comunidad de Madrid (2007) 3.65
    3.647767 = sum of:
      3.647767 = sum of:
        1.5060652 = weight(author_txt:rodríguez in 3118) [ClassicSimilarity], result of:
          1.5060652 = score(doc=3118,freq=1.0), product of:
            0.6202761 = queryWeight, product of:
              7.7697797 = idf(docFreq=49, maxDocs=43556)
              0.079831876 = queryNorm
            2.4280562 = fieldWeight in 3118, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.7697797 = idf(docFreq=49, maxDocs=43556)
              0.3125 = fieldNorm(doc=3118)
        2.1417017 = weight(author_txt:gómez in 3118) [ClassicSimilarity], result of:
          2.1417017 = score(doc=3118,freq=1.0), product of:
            0.7843836 = queryWeight, product of:
              1.1245317 = boost
              8.737364 = idf(docFreq=18, maxDocs=43556)
              0.079831876 = queryNorm
            2.7304263 = fieldWeight in 3118, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.737364 = idf(docFreq=18, maxDocs=43556)
              0.3125 = fieldNorm(doc=3118)
    
  4. Gómez Prada, R. Gómez => Gómez Prada, R.: 2.23
    2.2257214 = sum of:
      2.2257214 = product of:
        4.4514427 = sum of:
          4.4514427 = weight(author_txt:gómez in 1405) [ClassicSimilarity], result of:
            4.4514427 = score(doc=1405,freq=3.0), product of:
              0.7843836 = queryWeight, product of:
                1.1245317 = boost
                8.737364 = idf(docFreq=18, maxDocs=43556)
                0.079831876 = queryNorm
              5.675084 = fieldWeight in 1405, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.737364 = idf(docFreq=18, maxDocs=43556)
                0.375 = fieldNorm(doc=1405)
        0.5 = coord(1/2)
    
  5. Gómez, C. Olmeda- -> Olmeda-Gómez, C.: 1.82
    1.8172939 = sum of:
      1.8172939 = product of:
        3.6345878 = sum of:
          3.6345878 = weight(author_txt:gómez in 7444) [ClassicSimilarity], result of:
            3.6345878 = score(doc=7444,freq=2.0), product of:
              0.7843836 = queryWeight, product of:
                1.1245317 = boost
                8.737364 = idf(docFreq=18, maxDocs=43556)
                0.079831876 = queryNorm
              4.6336865 = fieldWeight in 7444, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.737364 = idf(docFreq=18, maxDocs=43556)
                0.375 = fieldNorm(doc=7444)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Kwok, K.L.: Employing multiple representations for Chinese information retrieval (1999) 0.26
    0.2623774 = sum of:
      0.2623774 = product of:
        0.9370622 = sum of:
          0.072527364 = weight(abstract_txt:characters in 4771) [ClassicSimilarity], result of:
            0.072527364 = score(doc=4771,freq=2.0), product of:
              0.11142293 = queryWeight, product of:
                1.0285336 = boost
                7.3643146 = idf(docFreq=74, maxDocs=43556)
                0.014710376 = queryNorm
              0.65091956 = fieldWeight in 4771, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.3643146 = idf(docFreq=74, maxDocs=43556)
                0.0625 = fieldNorm(doc=4771)
          0.027707584 = weight(abstract_txt:system in 4771) [ClassicSimilarity], result of:
            0.027707584 = score(doc=4771,freq=2.0), product of:
              0.09312345 = queryWeight, product of:
                1.8805752 = boost
                3.3662362 = idf(docFreq=4086, maxDocs=43556)
                0.014710376 = queryNorm
              0.29753605 = fieldWeight in 4771, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3662362 = idf(docFreq=4086, maxDocs=43556)
                0.0625 = fieldNorm(doc=4771)
          0.12992884 = weight(abstract_txt:gram in 4771) [ClassicSimilarity], result of:
            0.12992884 = score(doc=4771,freq=1.0), product of:
              0.26089373 = queryWeight, product of:
                2.2257583 = boost
                7.9682307 = idf(docFreq=40, maxDocs=43556)
                0.014710376 = queryNorm
              0.49801442 = fieldWeight in 4771, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9682307 = idf(docFreq=40, maxDocs=43556)
                0.0625 = fieldNorm(doc=4771)
          0.03778687 = weight(abstract_txt:language in 4771) [ClassicSimilarity], result of:
            0.03778687 = score(doc=4771,freq=1.0), product of:
              0.14428812 = queryWeight, product of:
                2.3408654 = boost
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.014710376 = queryNorm
              0.26188484 = fieldWeight in 4771, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.0625 = fieldNorm(doc=4771)
          0.15039487 = weight(abstract_txt:character in 4771) [ClassicSimilarity], result of:
            0.15039487 = score(doc=4771,freq=2.0), product of:
              0.26131785 = queryWeight, product of:
                2.728201 = boost
                6.5113187 = idf(docFreq=175, maxDocs=43556)
                0.014710376 = queryNorm
              0.5755247 = fieldWeight in 4771, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5113187 = idf(docFreq=175, maxDocs=43556)
                0.0625 = fieldNorm(doc=4771)
          0.27070785 = weight(abstract_txt:segmentation in 4771) [ClassicSimilarity], result of:
            0.27070785 = score(doc=4771,freq=2.0), product of:
              0.38667634 = queryWeight, product of:
                3.3186817 = boost
                7.920603 = idf(docFreq=42, maxDocs=43556)
                0.014710376 = queryNorm
              0.700089 = fieldWeight in 4771, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.920603 = idf(docFreq=42, maxDocs=43556)
                0.0625 = fieldNorm(doc=4771)
          0.24800885 = weight(abstract_txt:word in 4771) [ClassicSimilarity], result of:
            0.24800885 = score(doc=4771,freq=4.0), product of:
              0.36474708 = queryWeight, product of:
                4.558298 = boost
                5.4395795 = idf(docFreq=513, maxDocs=43556)
                0.014710376 = queryNorm
              0.67994744 = fieldWeight in 4771, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4395795 = idf(docFreq=513, maxDocs=43556)
                0.0625 = fieldNorm(doc=4771)
        0.28 = coord(7/25)
    
  2. Wang, F.L.; Yang, C.C.: Mining Web data for Chinese segmentation (2007) 0.25
    0.25358877 = sum of:
      0.25358877 = product of:
        1.0566199 = sum of:
          0.052739527 = weight(abstract_txt:sequences in 2602) [ClassicSimilarity], result of:
            0.052739527 = score(doc=2602,freq=1.0), product of:
              0.11352045 = queryWeight, product of:
                1.0381694 = boost
                7.4333076 = idf(docFreq=69, maxDocs=43556)
                0.014710376 = queryNorm
              0.46458173 = fieldWeight in 2602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4333076 = idf(docFreq=69, maxDocs=43556)
                0.0625 = fieldNorm(doc=2602)
          0.0073468024 = weight(abstract_txt:this in 2602) [ClassicSimilarity], result of:
            0.0073468024 = score(doc=2602,freq=1.0), product of:
              0.048424914 = queryWeight, product of:
                1.3561121 = boost
                2.4274454 = idf(docFreq=10449, maxDocs=43556)
                0.014710376 = queryNorm
              0.15171534 = fieldWeight in 2602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4274454 = idf(docFreq=10449, maxDocs=43556)
                0.0625 = fieldNorm(doc=2602)
          0.06544878 = weight(abstract_txt:language in 2602) [ClassicSimilarity], result of:
            0.06544878 = score(doc=2602,freq=3.0), product of:
              0.14428812 = queryWeight, product of:
                2.3408654 = boost
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.014710376 = queryNorm
              0.45359784 = fieldWeight in 2602, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.0625 = fieldNorm(doc=2602)
          0.15039487 = weight(abstract_txt:character in 2602) [ClassicSimilarity], result of:
            0.15039487 = score(doc=2602,freq=2.0), product of:
              0.26131785 = queryWeight, product of:
                2.728201 = boost
                6.5113187 = idf(docFreq=175, maxDocs=43556)
                0.014710376 = queryNorm
              0.5755247 = fieldWeight in 2602, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5113187 = idf(docFreq=175, maxDocs=43556)
                0.0625 = fieldNorm(doc=2602)
          0.60532117 = weight(abstract_txt:segmentation in 2602) [ClassicSimilarity], result of:
            0.60532117 = score(doc=2602,freq=10.0), product of:
              0.38667634 = queryWeight, product of:
                3.3186817 = boost
                7.920603 = idf(docFreq=42, maxDocs=43556)
                0.014710376 = queryNorm
              1.5654466 = fieldWeight in 2602, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                7.920603 = idf(docFreq=42, maxDocs=43556)
                0.0625 = fieldNorm(doc=2602)
          0.17536873 = weight(abstract_txt:word in 2602) [ClassicSimilarity], result of:
            0.17536873 = score(doc=2602,freq=2.0), product of:
              0.36474708 = queryWeight, product of:
                4.558298 = boost
                5.4395795 = idf(docFreq=513, maxDocs=43556)
                0.014710376 = queryNorm
              0.48079544 = fieldWeight in 2602, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4395795 = idf(docFreq=513, maxDocs=43556)
                0.0625 = fieldNorm(doc=2602)
        0.24 = coord(6/25)
    
  3. Peng, F.; Huang, X.: Machine learning for Asian language text classification (2007) 0.24
    0.23589224 = sum of:
      0.23589224 = product of:
        0.98288435 = sum of:
          0.010389947 = weight(abstract_txt:this in 2829) [ClassicSimilarity], result of:
            0.010389947 = score(doc=2829,freq=2.0), product of:
              0.048424914 = queryWeight, product of:
                1.3561121 = boost
                2.4274454 = idf(docFreq=10449, maxDocs=43556)
                0.014710376 = queryNorm
              0.21455789 = fieldWeight in 2829, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4274454 = idf(docFreq=10449, maxDocs=43556)
                0.0625 = fieldNorm(doc=2829)
          0.024692861 = weight(abstract_txt:model in 2829) [ClassicSimilarity], result of:
            0.024692861 = score(doc=2829,freq=1.0), product of:
              0.09871989 = queryWeight, product of:
                1.6768497 = boost
                4.002089 = idf(docFreq=2163, maxDocs=43556)
                0.014710376 = queryNorm
              0.25013056 = fieldWeight in 2829, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.002089 = idf(docFreq=2163, maxDocs=43556)
                0.0625 = fieldNorm(doc=2829)
          0.11777098 = weight(abstract_txt:boundary in 2829) [ClassicSimilarity], result of:
            0.11777098 = score(doc=2829,freq=1.0), product of:
              0.24435364 = queryWeight, product of:
                2.154049 = boost
                7.7115107 = idf(docFreq=52, maxDocs=43556)
                0.014710376 = queryNorm
              0.48196942 = fieldWeight in 2829, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7115107 = idf(docFreq=52, maxDocs=43556)
                0.0625 = fieldNorm(doc=2829)
          0.07557374 = weight(abstract_txt:language in 2829) [ClassicSimilarity], result of:
            0.07557374 = score(doc=2829,freq=4.0), product of:
              0.14428812 = queryWeight, product of:
                2.3408654 = boost
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.014710376 = queryNorm
              0.5237697 = fieldWeight in 2829, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.0625 = fieldNorm(doc=2829)
          0.506448 = weight(abstract_txt:segmentation in 2829) [ClassicSimilarity], result of:
            0.506448 = score(doc=2829,freq=7.0), product of:
              0.38667634 = queryWeight, product of:
                3.3186817 = boost
                7.920603 = idf(docFreq=42, maxDocs=43556)
                0.014710376 = queryNorm
              1.3097465 = fieldWeight in 2829, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.920603 = idf(docFreq=42, maxDocs=43556)
                0.0625 = fieldNorm(doc=2829)
          0.24800885 = weight(abstract_txt:word in 2829) [ClassicSimilarity], result of:
            0.24800885 = score(doc=2829,freq=4.0), product of:
              0.36474708 = queryWeight, product of:
                4.558298 = boost
                5.4395795 = idf(docFreq=513, maxDocs=43556)
                0.014710376 = queryNorm
              0.67994744 = fieldWeight in 2829, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4395795 = idf(docFreq=513, maxDocs=43556)
                0.0625 = fieldNorm(doc=2829)
        0.24 = coord(6/25)
    
  4. Lee, K.H.; Ng, M.K.M.; Lu, Q.: Text segmentation for Chinese spell checking (1999) 0.20
    0.200797 = sum of:
      0.200797 = product of:
        0.8366542 = sum of:
          0.051284596 = weight(abstract_txt:characters in 4911) [ClassicSimilarity], result of:
            0.051284596 = score(doc=4911,freq=1.0), product of:
              0.11142293 = queryWeight, product of:
                1.0285336 = boost
                7.3643146 = idf(docFreq=74, maxDocs=43556)
                0.014710376 = queryNorm
              0.46026966 = fieldWeight in 4911, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3643146 = idf(docFreq=74, maxDocs=43556)
                0.0625 = fieldNorm(doc=4911)
          0.010389947 = weight(abstract_txt:this in 4911) [ClassicSimilarity], result of:
            0.010389947 = score(doc=4911,freq=2.0), product of:
              0.048424914 = queryWeight, product of:
                1.3561121 = boost
                2.4274454 = idf(docFreq=10449, maxDocs=43556)
                0.014710376 = queryNorm
              0.21455789 = fieldWeight in 4911, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4274454 = idf(docFreq=10449, maxDocs=43556)
                0.0625 = fieldNorm(doc=4911)
          0.03778687 = weight(abstract_txt:language in 4911) [ClassicSimilarity], result of:
            0.03778687 = score(doc=4911,freq=1.0), product of:
              0.14428812 = queryWeight, product of:
                2.3408654 = boost
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.014710376 = queryNorm
              0.26188484 = fieldWeight in 4911, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.0625 = fieldNorm(doc=4911)
          0.10634524 = weight(abstract_txt:character in 4911) [ClassicSimilarity], result of:
            0.10634524 = score(doc=4911,freq=1.0), product of:
              0.26131785 = queryWeight, product of:
                2.728201 = boost
                6.5113187 = idf(docFreq=175, maxDocs=43556)
                0.014710376 = queryNorm
              0.40695742 = fieldWeight in 4911, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5113187 = idf(docFreq=175, maxDocs=43556)
                0.0625 = fieldNorm(doc=4911)
          0.38283873 = weight(abstract_txt:segmentation in 4911) [ClassicSimilarity], result of:
            0.38283873 = score(doc=4911,freq=4.0), product of:
              0.38667634 = queryWeight, product of:
                3.3186817 = boost
                7.920603 = idf(docFreq=42, maxDocs=43556)
                0.014710376 = queryNorm
              0.99007535 = fieldWeight in 4911, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.920603 = idf(docFreq=42, maxDocs=43556)
                0.0625 = fieldNorm(doc=4911)
          0.24800885 = weight(abstract_txt:word in 4911) [ClassicSimilarity], result of:
            0.24800885 = score(doc=4911,freq=4.0), product of:
              0.36474708 = queryWeight, product of:
                4.558298 = boost
                5.4395795 = idf(docFreq=513, maxDocs=43556)
                0.014710376 = queryNorm
              0.67994744 = fieldWeight in 4911, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4395795 = idf(docFreq=513, maxDocs=43556)
                0.0625 = fieldNorm(doc=4911)
        0.24 = coord(6/25)
    
  5. Yang, C.C.; Li, K.W.: ¬A heuristic method based on a statistical approach for chinese text segmentation (2005) 0.20
    0.19557093 = sum of:
      0.19557093 = product of:
        0.97785467 = sum of:
          0.051284596 = weight(abstract_txt:characters in 578) [ClassicSimilarity], result of:
            0.051284596 = score(doc=578,freq=1.0), product of:
              0.11142293 = queryWeight, product of:
                1.0285336 = boost
                7.3643146 = idf(docFreq=74, maxDocs=43556)
                0.014710376 = queryNorm
              0.46026966 = fieldWeight in 578, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3643146 = idf(docFreq=74, maxDocs=43556)
                0.0625 = fieldNorm(doc=578)
          0.010389947 = weight(abstract_txt:this in 578) [ClassicSimilarity], result of:
            0.010389947 = score(doc=578,freq=2.0), product of:
              0.048424914 = queryWeight, product of:
                1.3561121 = boost
                2.4274454 = idf(docFreq=10449, maxDocs=43556)
                0.014710376 = queryNorm
              0.21455789 = fieldWeight in 578, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4274454 = idf(docFreq=10449, maxDocs=43556)
                0.0625 = fieldNorm(doc=578)
          0.16655332 = weight(abstract_txt:boundary in 578) [ClassicSimilarity], result of:
            0.16655332 = score(doc=578,freq=2.0), product of:
              0.24435364 = queryWeight, product of:
                2.154049 = boost
                7.7115107 = idf(docFreq=52, maxDocs=43556)
                0.014710376 = queryNorm
              0.68160766 = fieldWeight in 578, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7115107 = idf(docFreq=52, maxDocs=43556)
                0.0625 = fieldNorm(doc=578)
          0.5742581 = weight(abstract_txt:segmentation in 578) [ClassicSimilarity], result of:
            0.5742581 = score(doc=578,freq=9.0), product of:
              0.38667634 = queryWeight, product of:
                3.3186817 = boost
                7.920603 = idf(docFreq=42, maxDocs=43556)
                0.014710376 = queryNorm
              1.485113 = fieldWeight in 578, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                7.920603 = idf(docFreq=42, maxDocs=43556)
                0.0625 = fieldNorm(doc=578)
          0.17536873 = weight(abstract_txt:word in 578) [ClassicSimilarity], result of:
            0.17536873 = score(doc=578,freq=2.0), product of:
              0.36474708 = queryWeight, product of:
                4.558298 = boost
                5.4395795 = idf(docFreq=513, maxDocs=43556)
                0.014710376 = queryNorm
              0.48079544 = fieldWeight in 578, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4395795 = idf(docFreq=513, maxDocs=43556)
                0.0625 = fieldNorm(doc=578)
        0.2 = coord(5/25)