Document (#44014)

Author
Jiang, Y.
Meng, R.
Huang, Y.
Lu, W.
Liu, J.
Title
Generating keyphrases for readers : a controllable keyphrase generation framework
Source
Journal of the Association for Information Science and Technology. 74(2023) no.7, S.759-774
Year
2023
Abstract
With the wide application of keyphrases in many Information Retrieval (IR) and Natural Language Processing (NLP) tasks, automatic keyphrase prediction has been emerging. However, these statistically important phrases are contributing increasingly less to the related tasks because the end-to-end learning mechanism enables models to learn the important semantic information of the text directly. Similarly, keyphrases are of little help for readers to quickly grasp the paper's main idea because the relationship between the keyphrase and the paper is not explicit to readers. Therefore, we propose to generate keyphrases with specific functions for readers to bridge the semantic gap between them and the information producers, and verify the effectiveness of the keyphrase function for assisting users' comprehension with a user experiment. A controllable keyphrase generation framework (the CKPG) that uses the keyphrase function as a control code to generate categorized keyphrases is proposed and implemented based on Transformer, BART, and T5, respectively. For the Computer Science domain, the Macro-avgs of , , and on the Paper with Code dataset are up to 0.680, 0.535, and 0.558, respectively. Our experimental results indicate the effectiveness of the CKPG models.
Content
Vgl.: https://asistdl.onlinelibrary.wiley.com/doi/10.1002/asi.24749.
Theme
Automatisches Abstracting
Object
BART
T5

Similar documents (author)

  1. Meng, M.: ¬A conceptual framework for online education programs (1993) 1.35
    1.3503542 = sum of:
      1.3503542 = product of:
        4.0510626 = sum of:
          4.0510626 = weight(author_txt:meng in 7822) [ClassicSimilarity], result of:
            4.0510626 = score(doc=7822,freq=1.0), product of:
              0.70361626 = queryWeight, product of:
                1.2799612 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.05967412 = queryNorm
              5.7574883 = fieldWeight in 7822, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.625 = fieldNorm(doc=7822)
        0.33333334 = coord(1/3)
    
  2. Meng, L.: ¬The creation of [the] Chinese Science Citation Database : status quo and future development (1997) 1.35
    1.3503542 = sum of:
      1.3503542 = product of:
        4.0510626 = sum of:
          4.0510626 = weight(author_txt:meng in 954) [ClassicSimilarity], result of:
            4.0510626 = score(doc=954,freq=1.0), product of:
              0.70361626 = queryWeight, product of:
                1.2799612 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.05967412 = queryNorm
              5.7574883 = fieldWeight in 954, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.625 = fieldNorm(doc=954)
        0.33333334 = coord(1/3)
    
  3. Jiang, D.: ¬A feasibility study of the outsourcing of cataloging in the academic libraries (1998) 0.97
    0.9745097 = sum of:
      0.9745097 = product of:
        2.9235291 = sum of:
          2.9235291 = weight(author_txt:jiang in 4622) [ClassicSimilarity], result of:
            2.9235291 = score(doc=4622,freq=1.0), product of:
              0.56610227 = queryWeight, product of:
                1.1480911 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.05967412 = queryNorm
              5.164313 = fieldWeight in 4622, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.625 = fieldNorm(doc=4622)
        0.33333334 = coord(1/3)
    
  4. Jiang, S.Y.: Lost in translation : the treatment of Chinese classics in the Library of Congress Classification (2007) 0.97
    0.9745097 = sum of:
      0.9745097 = product of:
        2.9235291 = sum of:
          2.9235291 = weight(author_txt:jiang in 773) [ClassicSimilarity], result of:
            2.9235291 = score(doc=773,freq=1.0), product of:
              0.56610227 = queryWeight, product of:
                1.1480911 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.05967412 = queryNorm
              5.164313 = fieldWeight in 773, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.625 = fieldNorm(doc=773)
        0.33333334 = coord(1/3)
    
  5. Jiang, T.: Architektur und Anwendungen des kollaborativen Lernsystems K3 (2008) 0.97
    0.9745097 = sum of:
      0.9745097 = product of:
        2.9235291 = sum of:
          2.9235291 = weight(author_txt:jiang in 1391) [ClassicSimilarity], result of:
            2.9235291 = score(doc=1391,freq=1.0), product of:
              0.56610227 = queryWeight, product of:
                1.1480911 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.05967412 = queryNorm
              5.164313 = fieldWeight in 1391, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.625 = fieldNorm(doc=1391)
        0.33333334 = coord(1/3)
    

Similar documents (content)

  1. Wu, Y.-f.B.; Li, Q.; Bot, R.S.; Chen, X.: Finding nuggets in documents : a machine learning approach (2006) 0.40
    0.39718732 = sum of:
      0.39718732 = product of:
        1.6549472 = sum of:
          0.014661205 = weight(abstract_txt:semantic in 5290) [ClassicSimilarity], result of:
            0.014661205 = score(doc=5290,freq=1.0), product of:
              0.052427903 = queryWeight, product of:
                1.2014195 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.009753054 = queryNorm
              0.2796451 = fieldWeight in 5290, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0625 = fieldNorm(doc=5290)
          0.01880846 = weight(abstract_txt:because in 5290) [ClassicSimilarity], result of:
            0.01880846 = score(doc=5290,freq=1.0), product of:
              0.061899174 = queryWeight, product of:
                1.3054368 = boost
                4.8617024 = idf(docFreq=929, maxDocs=44218)
                0.009753054 = queryNorm
              0.3038564 = fieldWeight in 5290, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8617024 = idf(docFreq=929, maxDocs=44218)
                0.0625 = fieldNorm(doc=5290)
          0.04014294 = weight(abstract_txt:function in 5290) [ClassicSimilarity], result of:
            0.04014294 = score(doc=5290,freq=2.0), product of:
              0.08144145 = queryWeight, product of:
                1.4973942 = boost
                5.5765896 = idf(docFreq=454, maxDocs=44218)
                0.009753054 = queryNorm
              0.49290553 = fieldWeight in 5290, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5765896 = idf(docFreq=454, maxDocs=44218)
                0.0625 = fieldNorm(doc=5290)
          0.03608212 = weight(abstract_txt:generate in 5290) [ClassicSimilarity], result of:
            0.03608212 = score(doc=5290,freq=1.0), product of:
              0.09556761 = queryWeight, product of:
                1.622067 = boost
                6.0408955 = idf(docFreq=285, maxDocs=44218)
                0.009753054 = queryNorm
              0.37755597 = fieldWeight in 5290, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0408955 = idf(docFreq=285, maxDocs=44218)
                0.0625 = fieldNorm(doc=5290)
          0.8975749 = weight(abstract_txt:keyphrases in 5290) [ClassicSimilarity], result of:
            0.8975749 = score(doc=5290,freq=7.0), product of:
              0.5777995 = queryWeight, product of:
                6.306262 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.009753054 = queryNorm
              1.5534366 = fieldWeight in 5290, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=5290)
          0.64767754 = weight(abstract_txt:keyphrase in 5290) [ClassicSimilarity], result of:
            0.64767754 = score(doc=5290,freq=3.0), product of:
              0.6551719 = queryWeight, product of:
                7.3561683 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.009753054 = queryNorm
              0.9885613 = fieldWeight in 5290, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.0625 = fieldNorm(doc=5290)
        0.24 = coord(6/25)
    
  2. Pirkola, A.: Constructing topic-specific search keyphrase suggestion tools for Web information retrieval (2010) 0.25
    0.25183997 = sum of:
      0.25183997 = product of:
        1.5739999 = sum of:
          0.023510575 = weight(abstract_txt:because in 4665) [ClassicSimilarity], result of:
            0.023510575 = score(doc=4665,freq=1.0), product of:
              0.061899174 = queryWeight, product of:
                1.3054368 = boost
                4.8617024 = idf(docFreq=929, maxDocs=44218)
                0.009753054 = queryNorm
              0.3798205 = fieldWeight in 4665, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8617024 = idf(docFreq=929, maxDocs=44218)
                0.078125 = fieldNorm(doc=4665)
          0.0063915947 = weight(abstract_txt:with in 4665) [ClassicSimilarity], result of:
            0.0063915947 = score(doc=4665,freq=1.0), product of:
              0.032728456 = queryWeight, product of:
                1.3424286 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.009753054 = queryNorm
              0.19529167 = fieldWeight in 4665, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.078125 = fieldNorm(doc=4665)
          0.7345009 = weight(abstract_txt:keyphrases in 4665) [ClassicSimilarity], result of:
            0.7345009 = score(doc=4665,freq=3.0), product of:
              0.5777995 = queryWeight, product of:
                6.306262 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.009753054 = queryNorm
              1.2712038 = fieldWeight in 4665, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.078125 = fieldNorm(doc=4665)
          0.8095969 = weight(abstract_txt:keyphrase in 4665) [ClassicSimilarity], result of:
            0.8095969 = score(doc=4665,freq=3.0), product of:
              0.6551719 = queryWeight, product of:
                7.3561683 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.009753054 = queryNorm
              1.2357016 = fieldWeight in 4665, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.078125 = fieldNorm(doc=4665)
        0.16 = coord(4/25)
    
  3. Zhang, Y.; Zhang, C.: Enhancing keyphrase extraction from microblogs using human reading time (2021) 0.23
    0.23092858 = sum of:
      0.23092858 = product of:
        1.1546429 = sum of:
          0.012254887 = weight(abstract_txt:important in 237) [ClassicSimilarity], result of:
            0.012254887 = score(doc=237,freq=1.0), product of:
              0.0465217 = queryWeight, product of:
                1.1317258 = boost
                4.2147684 = idf(docFreq=1775, maxDocs=44218)
                0.009753054 = queryNorm
              0.26342303 = fieldWeight in 237, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2147684 = idf(docFreq=1775, maxDocs=44218)
                0.0625 = fieldNorm(doc=237)
          0.03273513 = weight(abstract_txt:models in 237) [ClassicSimilarity], result of:
            0.03273513 = score(doc=237,freq=4.0), product of:
              0.056420736 = queryWeight, product of:
                1.2463293 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.009753054 = queryNorm
              0.5801968 = fieldWeight in 237, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0625 = fieldNorm(doc=237)
          0.022527827 = weight(abstract_txt:tasks in 237) [ClassicSimilarity], result of:
            0.022527827 = score(doc=237,freq=1.0), product of:
              0.069811806 = queryWeight, product of:
                1.3863659 = boost
                5.1630983 = idf(docFreq=687, maxDocs=44218)
                0.009753054 = queryNorm
              0.32269365 = fieldWeight in 237, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1630983 = idf(docFreq=687, maxDocs=44218)
                0.0625 = fieldNorm(doc=237)
          0.33925146 = weight(abstract_txt:keyphrases in 237) [ClassicSimilarity], result of:
            0.33925146 = score(doc=237,freq=1.0), product of:
              0.5777995 = queryWeight, product of:
                6.306262 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.009753054 = queryNorm
              0.5871439 = fieldWeight in 237, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=237)
          0.7478736 = weight(abstract_txt:keyphrase in 237) [ClassicSimilarity], result of:
            0.7478736 = score(doc=237,freq=4.0), product of:
              0.6551719 = queryWeight, product of:
                7.3561683 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.009753054 = queryNorm
              1.1414922 = fieldWeight in 237, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.0625 = fieldNorm(doc=237)
        0.2 = coord(5/25)
    
  4. Daudaravicius, V.: ¬A framework for keyphrase extraction from scientific journals (2016) 0.21
    0.21135385 = sum of:
      0.21135385 = product of:
        1.3209616 = sum of:
          0.032725554 = weight(abstract_txt:framework in 2930) [ClassicSimilarity], result of:
            0.032725554 = score(doc=2930,freq=2.0), product of:
              0.05423794 = queryWeight, product of:
                1.2219826 = boost
                4.550903 = idf(docFreq=1268, maxDocs=44218)
                0.009753054 = queryNorm
              0.6033702 = fieldWeight in 2930, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.550903 = idf(docFreq=1268, maxDocs=44218)
                0.09375 = fieldNorm(doc=2930)
          0.007669914 = weight(abstract_txt:with in 2930) [ClassicSimilarity], result of:
            0.007669914 = score(doc=2930,freq=1.0), product of:
              0.032728456 = queryWeight, product of:
                1.3424286 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.009753054 = queryNorm
              0.23435001 = fieldWeight in 2930, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.09375 = fieldNorm(doc=2930)
          0.71966094 = weight(abstract_txt:keyphrases in 2930) [ClassicSimilarity], result of:
            0.71966094 = score(doc=2930,freq=2.0), product of:
              0.5777995 = queryWeight, product of:
                6.306262 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.009753054 = queryNorm
              1.2455202 = fieldWeight in 2930, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.09375 = fieldNorm(doc=2930)
          0.5609052 = weight(abstract_txt:keyphrase in 2930) [ClassicSimilarity], result of:
            0.5609052 = score(doc=2930,freq=1.0), product of:
              0.6551719 = queryWeight, product of:
                7.3561683 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.009753054 = queryNorm
              0.85611916 = fieldWeight in 2930, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.09375 = fieldNorm(doc=2930)
        0.16 = coord(4/25)
    
  5. Medelyan, O.; Witten, I.H.: Domain-independent automatic keyphrase indexing with small training sets (2008) 0.19
    0.1888786 = sum of:
      0.1888786 = product of:
        0.944393 = sum of:
          0.018326506 = weight(abstract_txt:semantic in 1871) [ClassicSimilarity], result of:
            0.018326506 = score(doc=1871,freq=1.0), product of:
              0.052427903 = queryWeight, product of:
                1.2014195 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.009753054 = queryNorm
              0.34955636 = fieldWeight in 1871, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.078125 = fieldNorm(doc=1871)
          0.023510575 = weight(abstract_txt:because in 1871) [ClassicSimilarity], result of:
            0.023510575 = score(doc=1871,freq=1.0), product of:
              0.061899174 = queryWeight, product of:
                1.3054368 = boost
                4.8617024 = idf(docFreq=929, maxDocs=44218)
                0.009753054 = queryNorm
              0.3798205 = fieldWeight in 1871, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8617024 = idf(docFreq=929, maxDocs=44218)
                0.078125 = fieldNorm(doc=1871)
          0.011070567 = weight(abstract_txt:with in 1871) [ClassicSimilarity], result of:
            0.011070567 = score(doc=1871,freq=3.0), product of:
              0.032728456 = queryWeight, product of:
                1.3424286 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.009753054 = queryNorm
              0.3382551 = fieldWeight in 1871, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.078125 = fieldNorm(doc=1871)
          0.4240643 = weight(abstract_txt:keyphrases in 1871) [ClassicSimilarity], result of:
            0.4240643 = score(doc=1871,freq=1.0), product of:
              0.5777995 = queryWeight, product of:
                6.306262 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.009753054 = queryNorm
              0.7339299 = fieldWeight in 1871, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.078125 = fieldNorm(doc=1871)
          0.46742103 = weight(abstract_txt:keyphrase in 1871) [ClassicSimilarity], result of:
            0.46742103 = score(doc=1871,freq=1.0), product of:
              0.6551719 = queryWeight, product of:
                7.3561683 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.009753054 = queryNorm
              0.71343267 = fieldWeight in 1871, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.078125 = fieldNorm(doc=1871)
        0.2 = coord(5/25)