Document (#35317)

Author
Yang, Y.
Lu, Q.
Zhao, T.
Title
¬A delimiter-based general approach for Chinese term extraction
Source
Journal of the American Society for Information Science and Technology. 61(2010) no.1, S.111-125
Year
2009
Abstract
This article addresses a two-step approach for term extraction. In the first step on term candidate extraction, a new delimiter-based approach is proposed to identify features of the delimiters of term candidates rather than those of the term candidates themselves. This delimiter-based method is much more stable and domain independent than the previous approaches. In the second step on term verification, an algorithm using link analysis is applied to calculate the relevance between term candidates and the sentences from which the terms are extracted. All information is obtained from the working domain corpus without the need for prior domain knowledge. The approach is not targeted at any specific domain and there is no need for extensive training when applying it to new domains. In other words, the method is not domain dependent and it is especially useful for resource-limited domains. Evaluations of Chinese text in two different domains show quite significant improvements over existing techniques and also verify its efficiency and its relatively domain-independent nature. The proposed method is also very effective for extracting new terms so that it can serve as an efficient tool for updating domain knowledge, especially for expanding lexicons.
Theme
Computerlinguistik

Similar documents (author)

  1. Zhao, L.: Save space for "newcomers" : analyzing problems in book number assignment under the LCC system (2004) 1.78
    1.7770264 = sum of:
      1.7770264 = product of:
        3.5540528 = sum of:
          3.5540528 = weight(author_txt:zhao in 5082) [ClassicSimilarity], result of:
            3.5540528 = score(doc=5082,freq=1.0), product of:
              0.74338055 = queryWeight, product of:
                1.0542294 = boost
                7.649493 = idf(docFreq=55, maxDocs=43254)
                0.09218142 = queryNorm
              4.7809334 = fieldWeight in 5082, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.649493 = idf(docFreq=55, maxDocs=43254)
                0.625 = fieldNorm(doc=5082)
        0.5 = coord(1/2)
    
  2. Zhao, L.: How librarians used e-resources : an analysis of citations in CCQ (2006) 1.78
    1.7770264 = sum of:
      1.7770264 = product of:
        3.5540528 = sum of:
          3.5540528 = weight(author_txt:zhao in 767) [ClassicSimilarity], result of:
            3.5540528 = score(doc=767,freq=1.0), product of:
              0.74338055 = queryWeight, product of:
                1.0542294 = boost
                7.649493 = idf(docFreq=55, maxDocs=43254)
                0.09218142 = queryNorm
              4.7809334 = fieldWeight in 767, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.649493 = idf(docFreq=55, maxDocs=43254)
                0.625 = fieldNorm(doc=767)
        0.5 = coord(1/2)
    
  3. Zhao, D.: Challenges of scholarly publications on the Web to the evaluation of science : a comparison of author visibility on the Web and in print journals (2005) 1.78
    1.7770264 = sum of:
      1.7770264 = product of:
        3.5540528 = sum of:
          3.5540528 = weight(author_txt:zhao in 3066) [ClassicSimilarity], result of:
            3.5540528 = score(doc=3066,freq=1.0), product of:
              0.74338055 = queryWeight, product of:
                1.0542294 = boost
                7.649493 = idf(docFreq=55, maxDocs=43254)
                0.09218142 = queryNorm
              4.7809334 = fieldWeight in 3066, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.649493 = idf(docFreq=55, maxDocs=43254)
                0.625 = fieldNorm(doc=3066)
        0.5 = coord(1/2)
    
  4. Yang, S.C.: ¬An interpretive and situated approach to an evaluation of Perseus digital libraries (2001) 1.52
    1.5166608 = sum of:
      1.5166608 = product of:
        3.0333216 = sum of:
          3.0333216 = weight(author_txt:yang in 1934) [ClassicSimilarity], result of:
            3.0333216 = score(doc=1934,freq=1.0), product of:
              0.6688688 = queryWeight, product of:
                7.2560043 = idf(docFreq=82, maxDocs=43254)
                0.09218142 = queryNorm
              4.5350027 = fieldWeight in 1934, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2560043 = idf(docFreq=82, maxDocs=43254)
                0.625 = fieldNorm(doc=1934)
        0.5 = coord(1/2)
    
  5. Yang, K.: Information retrieval on the Web (2004) 1.52
    1.5166608 = sum of:
      1.5166608 = product of:
        3.0333216 = sum of:
          3.0333216 = weight(author_txt:yang in 279) [ClassicSimilarity], result of:
            3.0333216 = score(doc=279,freq=1.0), product of:
              0.6688688 = queryWeight, product of:
                7.2560043 = idf(docFreq=82, maxDocs=43254)
                0.09218142 = queryNorm
              4.5350027 = fieldWeight in 279, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2560043 = idf(docFreq=82, maxDocs=43254)
                0.625 = fieldNorm(doc=279)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Nagy T., I.: Detecting multiword expressions and named entities in natural language texts (2014) 0.27
    0.27282217 = sum of:
      0.27282217 = product of:
        0.6200504 = sum of:
          0.008388047 = weight(abstract_txt:knowledge in 3001) [ClassicSimilarity], result of:
            0.008388047 = score(doc=3001,freq=1.0), product of:
              0.059989087 = queryWeight, product of:
                3.5795512 = idf(docFreq=3278, maxDocs=43254)
                0.01675883 = queryNorm
              0.13982622 = fieldWeight in 3001, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5795512 = idf(docFreq=3278, maxDocs=43254)
                0.0390625 = fieldNorm(doc=3001)
          0.035838235 = weight(abstract_txt:candidate in 3001) [ClassicSimilarity], result of:
            0.035838235 = score(doc=3001,freq=1.0), product of:
              0.12536782 = queryWeight, product of:
                1.0222142 = boost
                7.318136 = idf(docFreq=77, maxDocs=43254)
                0.01675883 = queryNorm
              0.2858647 = fieldWeight in 3001, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.318136 = idf(docFreq=77, maxDocs=43254)
                0.0390625 = fieldNorm(doc=3001)
          0.010890628 = weight(abstract_txt:than in 3001) [ClassicSimilarity], result of:
            0.010890628 = score(doc=3001,freq=1.0), product of:
              0.07139486 = queryWeight, product of:
                1.0909312 = boost
                3.905044 = idf(docFreq=2367, maxDocs=43254)
                0.01675883 = queryNorm
              0.15254079 = fieldWeight in 3001, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.905044 = idf(docFreq=2367, maxDocs=43254)
                0.0390625 = fieldNorm(doc=3001)
          0.025473244 = weight(abstract_txt:based in 3001) [ClassicSimilarity], result of:
            0.025473244 = score(doc=3001,freq=8.0), product of:
              0.07200372 = queryWeight, product of:
                1.3417975 = boost
                3.2020218 = idf(docFreq=4782, maxDocs=43254)
                0.01675883 = queryNorm
              0.35377678 = fieldWeight in 3001, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.2020218 = idf(docFreq=4782, maxDocs=43254)
                0.0390625 = fieldNorm(doc=3001)
          0.056362193 = weight(abstract_txt:method in 3001) [ClassicSimilarity], result of:
            0.056362193 = score(doc=3001,freq=5.0), product of:
              0.14299823 = queryWeight, product of:
                1.8909273 = boost
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.01675883 = queryNorm
              0.39414608 = fieldWeight in 3001, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.0390625 = fieldNorm(doc=3001)
          0.027406 = weight(abstract_txt:approach in 3001) [ClassicSimilarity], result of:
            0.027406 = score(doc=3001,freq=2.0), product of:
              0.13208762 = queryWeight, product of:
                2.0985045 = boost
                3.7558525 = idf(docFreq=2748, maxDocs=43254)
                0.01675883 = queryNorm
              0.20748349 = fieldWeight in 3001, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7558525 = idf(docFreq=2748, maxDocs=43254)
                0.0390625 = fieldNorm(doc=3001)
          0.048522327 = weight(abstract_txt:domains in 3001) [ClassicSimilarity], result of:
            0.048522327 = score(doc=3001,freq=1.0), product of:
              0.22128731 = queryWeight, product of:
                2.3522732 = boost
                5.613388 = idf(docFreq=428, maxDocs=43254)
                0.01675883 = queryNorm
              0.21927297 = fieldWeight in 3001, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.613388 = idf(docFreq=428, maxDocs=43254)
                0.0390625 = fieldNorm(doc=3001)
          0.06317965 = weight(abstract_txt:step in 3001) [ClassicSimilarity], result of:
            0.06317965 = score(doc=3001,freq=1.0), product of:
              0.26386407 = queryWeight, product of:
                2.5686185 = boost
                6.1296678 = idf(docFreq=255, maxDocs=43254)
                0.01675883 = queryNorm
              0.23944014 = fieldWeight in 3001, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1296678 = idf(docFreq=255, maxDocs=43254)
                0.0390625 = fieldNorm(doc=3001)
          0.11384512 = weight(abstract_txt:extraction in 3001) [ClassicSimilarity], result of:
            0.11384512 = score(doc=3001,freq=3.0), product of:
              0.2709139 = queryWeight, product of:
                2.6027062 = boost
                6.2110133 = idf(docFreq=235, maxDocs=43254)
                0.01675883 = queryNorm
              0.4202262 = fieldWeight in 3001, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2110133 = idf(docFreq=235, maxDocs=43254)
                0.0390625 = fieldNorm(doc=3001)
          0.13250893 = weight(abstract_txt:candidates in 3001) [ClassicSimilarity], result of:
            0.13250893 = score(doc=3001,freq=1.0), product of:
              0.43234012 = queryWeight, product of:
                3.2879279 = boost
                7.846204 = idf(docFreq=45, maxDocs=43254)
                0.01675883 = queryNorm
              0.30649233 = fieldWeight in 3001, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.846204 = idf(docFreq=45, maxDocs=43254)
                0.0390625 = fieldNorm(doc=3001)
          0.09763607 = weight(abstract_txt:domain in 3001) [ClassicSimilarity], result of:
            0.09763607 = score(doc=3001,freq=2.0), product of:
              0.37129396 = queryWeight, product of:
                4.6543264 = boost
                4.760114 = idf(docFreq=1006, maxDocs=43254)
                0.01675883 = queryNorm
              0.26296163 = fieldWeight in 3001, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.760114 = idf(docFreq=1006, maxDocs=43254)
                0.0390625 = fieldNorm(doc=3001)
        0.44 = coord(11/25)
    
  2. Haas, S.W.: Disciplinary variation in automatic sublanguage term identification (1997) 0.25
    0.25488725 = sum of:
      0.25488725 = product of:
        0.91031164 = sum of:
          0.030180994 = weight(abstract_txt:than in 569) [ClassicSimilarity], result of:
            0.030180994 = score(doc=569,freq=3.0), product of:
              0.07139486 = queryWeight, product of:
                1.0909312 = boost
                3.905044 = idf(docFreq=2367, maxDocs=43254)
                0.01675883 = queryNorm
              0.42273343 = fieldWeight in 569, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.905044 = idf(docFreq=2367, maxDocs=43254)
                0.0625 = fieldNorm(doc=569)
          0.051737156 = weight(abstract_txt:terms in 569) [ClassicSimilarity], result of:
            0.051737156 = score(doc=569,freq=7.0), product of:
              0.07709994 = queryWeight, product of:
                1.133681 = boost
                4.058069 = idf(docFreq=2031, maxDocs=43254)
                0.01675883 = queryNorm
              0.6710401 = fieldWeight in 569, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.058069 = idf(docFreq=2031, maxDocs=43254)
                0.0625 = fieldNorm(doc=569)
          0.057034526 = weight(abstract_txt:method in 569) [ClassicSimilarity], result of:
            0.057034526 = score(doc=569,freq=2.0), product of:
              0.14299823 = queryWeight, product of:
                1.8909273 = boost
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.01675883 = queryNorm
              0.39884776 = fieldWeight in 569, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.0625 = fieldNorm(doc=569)
          0.10108744 = weight(abstract_txt:step in 569) [ClassicSimilarity], result of:
            0.10108744 = score(doc=569,freq=1.0), product of:
              0.26386407 = queryWeight, product of:
                2.5686185 = boost
                6.1296678 = idf(docFreq=255, maxDocs=43254)
                0.01675883 = queryNorm
              0.38310423 = fieldWeight in 569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1296678 = idf(docFreq=255, maxDocs=43254)
                0.0625 = fieldNorm(doc=569)
          0.14872663 = weight(abstract_txt:extraction in 569) [ClassicSimilarity], result of:
            0.14872663 = score(doc=569,freq=2.0), product of:
              0.2709139 = queryWeight, product of:
                2.6027062 = boost
                6.2110133 = idf(docFreq=235, maxDocs=43254)
                0.01675883 = queryNorm
              0.5489812 = fieldWeight in 569, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2110133 = idf(docFreq=235, maxDocs=43254)
                0.0625 = fieldNorm(doc=569)
          0.29225656 = weight(abstract_txt:domain in 569) [ClassicSimilarity], result of:
            0.29225656 = score(doc=569,freq=7.0), product of:
              0.37129396 = queryWeight, product of:
                4.6543264 = boost
                4.760114 = idf(docFreq=1006, maxDocs=43254)
                0.01675883 = queryNorm
              0.7871299 = fieldWeight in 569, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.760114 = idf(docFreq=1006, maxDocs=43254)
                0.0625 = fieldNorm(doc=569)
          0.22928827 = weight(abstract_txt:term in 569) [ClassicSimilarity], result of:
            0.22928827 = score(doc=569,freq=4.0), product of:
              0.38060597 = queryWeight, product of:
                4.71233 = boost
                4.819436 = idf(docFreq=948, maxDocs=43254)
                0.01675883 = queryNorm
              0.6024295 = fieldWeight in 569, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.819436 = idf(docFreq=948, maxDocs=43254)
                0.0625 = fieldNorm(doc=569)
        0.28 = coord(7/25)
    
  3. Li, N.; Sun, J.: Improving Chinese term association from the linguistic perspective (2017) 0.24
    0.24166735 = sum of:
      0.24166735 = product of:
        0.67129815 = sum of:
          0.016776094 = weight(abstract_txt:knowledge in 4846) [ClassicSimilarity], result of:
            0.016776094 = score(doc=4846,freq=1.0), product of:
              0.059989087 = queryWeight, product of:
                3.5795512 = idf(docFreq=3278, maxDocs=43254)
                0.01675883 = queryNorm
              0.27965245 = fieldWeight in 4846, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5795512 = idf(docFreq=3278, maxDocs=43254)
                0.078125 = fieldNorm(doc=4846)
          0.03456834 = weight(abstract_txt:terms in 4846) [ClassicSimilarity], result of:
            0.03456834 = score(doc=4846,freq=2.0), product of:
              0.07709994 = queryWeight, product of:
                1.133681 = boost
                4.058069 = idf(docFreq=2031, maxDocs=43254)
                0.01675883 = queryNorm
              0.44835752 = fieldWeight in 4846, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.058069 = idf(docFreq=2031, maxDocs=43254)
                0.078125 = fieldNorm(doc=4846)
          0.051223136 = weight(abstract_txt:proposed in 4846) [ClassicSimilarity], result of:
            0.051223136 = score(doc=4846,freq=2.0), product of:
              0.10021033 = queryWeight, product of:
                1.2924689 = boost
                4.6264586 = idf(docFreq=1150, maxDocs=43254)
                0.01675883 = queryNorm
              0.51115626 = fieldWeight in 4846, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6264586 = idf(docFreq=1150, maxDocs=43254)
                0.078125 = fieldNorm(doc=4846)
          0.025473244 = weight(abstract_txt:based in 4846) [ClassicSimilarity], result of:
            0.025473244 = score(doc=4846,freq=2.0), product of:
              0.07200372 = queryWeight, product of:
                1.3417975 = boost
                3.2020218 = idf(docFreq=4782, maxDocs=43254)
                0.01675883 = queryNorm
              0.35377678 = fieldWeight in 4846, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.2020218 = idf(docFreq=4782, maxDocs=43254)
                0.078125 = fieldNorm(doc=4846)
          0.092463866 = weight(abstract_txt:chinese in 4846) [ClassicSimilarity], result of:
            0.092463866 = score(doc=4846,freq=1.0), product of:
              0.18718012 = queryWeight, product of:
                1.7664189 = boost
                6.322987 = idf(docFreq=210, maxDocs=43254)
                0.01675883 = queryNorm
              0.49398336 = fieldWeight in 4846, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.322987 = idf(docFreq=210, maxDocs=43254)
                0.078125 = fieldNorm(doc=4846)
          0.07129316 = weight(abstract_txt:method in 4846) [ClassicSimilarity], result of:
            0.07129316 = score(doc=4846,freq=2.0), product of:
              0.14299823 = queryWeight, product of:
                1.8909273 = boost
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.01675883 = queryNorm
              0.4985597 = fieldWeight in 4846, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.078125 = fieldNorm(doc=4846)
          0.03875794 = weight(abstract_txt:approach in 4846) [ClassicSimilarity], result of:
            0.03875794 = score(doc=4846,freq=1.0), product of:
              0.13208762 = queryWeight, product of:
                2.0985045 = boost
                3.7558525 = idf(docFreq=2748, maxDocs=43254)
                0.01675883 = queryNorm
              0.29342598 = fieldWeight in 4846, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7558525 = idf(docFreq=2748, maxDocs=43254)
                0.078125 = fieldNorm(doc=4846)
          0.13807826 = weight(abstract_txt:domain in 4846) [ClassicSimilarity], result of:
            0.13807826 = score(doc=4846,freq=1.0), product of:
              0.37129396 = queryWeight, product of:
                4.6543264 = boost
                4.760114 = idf(docFreq=1006, maxDocs=43254)
                0.01675883 = queryNorm
              0.37188393 = fieldWeight in 4846, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.760114 = idf(docFreq=1006, maxDocs=43254)
                0.078125 = fieldNorm(doc=4846)
          0.2026641 = weight(abstract_txt:term in 4846) [ClassicSimilarity], result of:
            0.2026641 = score(doc=4846,freq=2.0), product of:
              0.38060597 = queryWeight, product of:
                4.71233 = boost
                4.819436 = idf(docFreq=948, maxDocs=43254)
                0.01675883 = queryNorm
              0.5324775 = fieldWeight in 4846, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.819436 = idf(docFreq=948, maxDocs=43254)
                0.078125 = fieldNorm(doc=4846)
        0.36 = coord(9/25)
    
  4. Terada, A.; Tokunaga, T.; Tanaka, H.: Automatic expansion of abbreviations by using context and character information (2004) 0.24
    0.23859146 = sum of:
      0.23859146 = product of:
        0.7455983 = sum of:
          0.08109266 = weight(abstract_txt:candidate in 4561) [ClassicSimilarity], result of:
            0.08109266 = score(doc=4561,freq=2.0), product of:
              0.12536782 = queryWeight, product of:
                1.0222142 = boost
                7.318136 = idf(docFreq=77, maxDocs=43254)
                0.01675883 = queryNorm
              0.64683795 = fieldWeight in 4561, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.318136 = idf(docFreq=77, maxDocs=43254)
                0.0625 = fieldNorm(doc=4561)
          0.06897797 = weight(abstract_txt:calculate in 4561) [ClassicSimilarity], result of:
            0.06897797 = score(doc=4561,freq=1.0), product of:
              0.14180186 = queryWeight, product of:
                1.0871509 = boost
                7.783025 = idf(docFreq=48, maxDocs=43254)
                0.01675883 = queryNorm
              0.48643905 = fieldWeight in 4561, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.783025 = idf(docFreq=48, maxDocs=43254)
                0.0625 = fieldNorm(doc=4561)
          0.017425004 = weight(abstract_txt:than in 4561) [ClassicSimilarity], result of:
            0.017425004 = score(doc=4561,freq=1.0), product of:
              0.07139486 = queryWeight, product of:
                1.0909312 = boost
                3.905044 = idf(docFreq=2367, maxDocs=43254)
                0.01675883 = queryNorm
              0.24406525 = fieldWeight in 4561, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.905044 = idf(docFreq=2367, maxDocs=43254)
                0.0625 = fieldNorm(doc=4561)
          0.028976185 = weight(abstract_txt:proposed in 4561) [ClassicSimilarity], result of:
            0.028976185 = score(doc=4561,freq=1.0), product of:
              0.10021033 = queryWeight, product of:
                1.2924689 = boost
                4.6264586 = idf(docFreq=1150, maxDocs=43254)
                0.01675883 = queryNorm
              0.28915367 = fieldWeight in 4561, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6264586 = idf(docFreq=1150, maxDocs=43254)
                0.0625 = fieldNorm(doc=4561)
          0.014409843 = weight(abstract_txt:based in 4561) [ClassicSimilarity], result of:
            0.014409843 = score(doc=4561,freq=1.0), product of:
              0.07200372 = queryWeight, product of:
                1.3417975 = boost
                3.2020218 = idf(docFreq=4782, maxDocs=43254)
                0.01675883 = queryNorm
              0.20012636 = fieldWeight in 4561, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2020218 = idf(docFreq=4782, maxDocs=43254)
                0.0625 = fieldNorm(doc=4561)
          0.057034526 = weight(abstract_txt:method in 4561) [ClassicSimilarity], result of:
            0.057034526 = score(doc=4561,freq=2.0), product of:
              0.14299823 = queryWeight, product of:
                1.8909273 = boost
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.01675883 = queryNorm
              0.39884776 = fieldWeight in 4561, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.0625 = fieldNorm(doc=4561)
          0.3672195 = weight(abstract_txt:candidates in 4561) [ClassicSimilarity], result of:
            0.3672195 = score(doc=4561,freq=3.0), product of:
              0.43234012 = queryWeight, product of:
                3.2879279 = boost
                7.846204 = idf(docFreq=45, maxDocs=43254)
                0.01675883 = queryNorm
              0.84937644 = fieldWeight in 4561, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.846204 = idf(docFreq=45, maxDocs=43254)
                0.0625 = fieldNorm(doc=4561)
          0.110462606 = weight(abstract_txt:domain in 4561) [ClassicSimilarity], result of:
            0.110462606 = score(doc=4561,freq=1.0), product of:
              0.37129396 = queryWeight, product of:
                4.6543264 = boost
                4.760114 = idf(docFreq=1006, maxDocs=43254)
                0.01675883 = queryNorm
              0.29750714 = fieldWeight in 4561, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.760114 = idf(docFreq=1006, maxDocs=43254)
                0.0625 = fieldNorm(doc=4561)
        0.32 = coord(8/25)
    
  5. Yang, T.-H.; Hsieh, Y.-L.; Liu, S.-H.; Chang, Y.-C.; Hsu, W.-L.: ¬A flexible template generation and matching method with applications for publication reference metadata extraction (2021) 0.23
    0.23245326 = sum of:
      0.23245326 = product of:
        0.5811331 = sum of:
          0.013420876 = weight(abstract_txt:knowledge in 1065) [ClassicSimilarity], result of:
            0.013420876 = score(doc=1065,freq=1.0), product of:
              0.059989087 = queryWeight, product of:
                3.5795512 = idf(docFreq=3278, maxDocs=43254)
                0.01675883 = queryNorm
              0.22372195 = fieldWeight in 1065, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5795512 = idf(docFreq=3278, maxDocs=43254)
                0.0625 = fieldNorm(doc=1065)
          0.017425004 = weight(abstract_txt:than in 1065) [ClassicSimilarity], result of:
            0.017425004 = score(doc=1065,freq=1.0), product of:
              0.07139486 = queryWeight, product of:
                1.0909312 = boost
                3.905044 = idf(docFreq=2367, maxDocs=43254)
                0.01675883 = queryNorm
              0.24406525 = fieldWeight in 1065, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.905044 = idf(docFreq=2367, maxDocs=43254)
                0.0625 = fieldNorm(doc=1065)
          0.021558056 = weight(abstract_txt:need in 1065) [ClassicSimilarity], result of:
            0.021558056 = score(doc=1065,freq=1.0), product of:
              0.08227946 = queryWeight, product of:
                1.1711421 = boost
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.01675883 = queryNorm
              0.2620102 = fieldWeight in 1065, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.0625 = fieldNorm(doc=1065)
          0.03222139 = weight(abstract_txt:based in 1065) [ClassicSimilarity], result of:
            0.03222139 = score(doc=1065,freq=5.0), product of:
              0.07200372 = queryWeight, product of:
                1.3417975 = boost
                3.2020218 = idf(docFreq=4782, maxDocs=43254)
                0.01675883 = queryNorm
              0.44749618 = fieldWeight in 1065, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.2020218 = idf(docFreq=4782, maxDocs=43254)
                0.0625 = fieldNorm(doc=1065)
          0.057593375 = weight(abstract_txt:independent in 1065) [ClassicSimilarity], result of:
            0.057593375 = score(doc=1065,freq=1.0), product of:
              0.15841636 = queryWeight, product of:
                1.6250393 = boost
                5.8169117 = idf(docFreq=349, maxDocs=43254)
                0.01675883 = queryNorm
              0.36355698 = fieldWeight in 1065, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8169117 = idf(docFreq=349, maxDocs=43254)
                0.0625 = fieldNorm(doc=1065)
          0.031006351 = weight(abstract_txt:approach in 1065) [ClassicSimilarity], result of:
            0.031006351 = score(doc=1065,freq=1.0), product of:
              0.13208762 = queryWeight, product of:
                2.0985045 = boost
                3.7558525 = idf(docFreq=2748, maxDocs=43254)
                0.01675883 = queryNorm
              0.23474078 = fieldWeight in 1065, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7558525 = idf(docFreq=2748, maxDocs=43254)
                0.0625 = fieldNorm(doc=1065)
          0.07763572 = weight(abstract_txt:domains in 1065) [ClassicSimilarity], result of:
            0.07763572 = score(doc=1065,freq=1.0), product of:
              0.22128731 = queryWeight, product of:
                2.3522732 = boost
                5.613388 = idf(docFreq=428, maxDocs=43254)
                0.01675883 = queryNorm
              0.35083675 = fieldWeight in 1065, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.613388 = idf(docFreq=428, maxDocs=43254)
                0.0625 = fieldNorm(doc=1065)
          0.105165616 = weight(abstract_txt:extraction in 1065) [ClassicSimilarity], result of:
            0.105165616 = score(doc=1065,freq=1.0), product of:
              0.2709139 = queryWeight, product of:
                2.6027062 = boost
                6.2110133 = idf(docFreq=235, maxDocs=43254)
                0.01675883 = queryNorm
              0.38818833 = fieldWeight in 1065, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2110133 = idf(docFreq=235, maxDocs=43254)
                0.0625 = fieldNorm(doc=1065)
          0.110462606 = weight(abstract_txt:domain in 1065) [ClassicSimilarity], result of:
            0.110462606 = score(doc=1065,freq=1.0), product of:
              0.37129396 = queryWeight, product of:
                4.6543264 = boost
                4.760114 = idf(docFreq=1006, maxDocs=43254)
                0.01675883 = queryNorm
              0.29750714 = fieldWeight in 1065, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.760114 = idf(docFreq=1006, maxDocs=43254)
                0.0625 = fieldNorm(doc=1065)
          0.11464413 = weight(abstract_txt:term in 1065) [ClassicSimilarity], result of:
            0.11464413 = score(doc=1065,freq=1.0), product of:
              0.38060597 = queryWeight, product of:
                4.71233 = boost
                4.819436 = idf(docFreq=948, maxDocs=43254)
                0.01675883 = queryNorm
              0.30121475 = fieldWeight in 1065, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.819436 = idf(docFreq=948, maxDocs=43254)
                0.0625 = fieldNorm(doc=1065)
        0.4 = coord(10/25)