Document (#39774)

Author
Denning, J.
Pera, M.S.
Ng, Y.-K.
Title
¬A readability level prediction tool for K-12 books
Source
Journal of the Association for Information Science and Technology. 67(2016) no.3, S.550-565
Year
2016
Abstract
The readability levels of books identify suitable reading materials. Unfortunately, the majority of published books are assigned a readability level range, which is not useful to readers who look for books at a particular grade level. Existing readability formulas/analysis tools require at least an excerpt of a book to estimate its readability level, which is a severe constraint, since copyright laws prohibit book contents from being made publicly accessible. To alleviate the constraint, we have developed TRoLL which relies on publicly accessible online book metadata, in addition to using a book's snippet, if it is available, to predict its readability level. Based on a multi-dimensional regression analysis, TRoLL determines the grade level of any book instantly, even without a sample of its text, and considers its topical suitability, which is unique. Furthermore, TRoLL is a significant contribution to the educational community, since its computed book readability levels can enrich K-12 readers' book selections and aid parents, teachers, and librarians in locating reading materials suitable for their K-12 readers, which can be a time-consuming and frustrating task that does not always yield a quality outcome. Conducted empirical studies have verified the prediction accuracy of TRoLL and demonstrated its superiority over well-known readability formulas/analysis tools.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23417/abstract.

Similar documents (author)

  1. Pera, M. Soledad => Soledad Pera, M.: 5.03
    5.026267 = sum of:
      5.026267 = weight(author_txt:pera in 3876) [ClassicSimilarity], result of:
        5.026267 = fieldWeight in 3876, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          9.47762 = idf(docFreq=8, maxDocs=43254)
          0.375 = fieldNorm(doc=3876)
    
  2. Pera, M.S.; Ng, Y.-K.: SpamED : a spam E-mail detection approach based on phrase similarity (2009) 4.15
    4.1464586 = sum of:
      4.1464586 = weight(author_txt:pera in 4722) [ClassicSimilarity], result of:
        4.1464586 = fieldWeight in 4722, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.47762 = idf(docFreq=8, maxDocs=43254)
          0.4375 = fieldNorm(doc=4722)
    
  3. Azpiazu, I.M.; Soledad Pera, M.: Is cross-lingual readability assessment possible? (2020) 4.15
    4.1464586 = sum of:
      4.1464586 = weight(author_txt:pera in 869) [ClassicSimilarity], result of:
        4.1464586 = fieldWeight in 869, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.47762 = idf(docFreq=8, maxDocs=43254)
          0.4375 = fieldNorm(doc=869)
    
  4. Pera, M.S.; Lund, W.; Ng, Y.-K.: ¬A sophisticated library search strategy using folksonomies and similarity matching (2009) 3.55
    3.5541077 = sum of:
      3.5541077 = weight(author_txt:pera in 4940) [ClassicSimilarity], result of:
        3.5541077 = fieldWeight in 4940, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.47762 = idf(docFreq=8, maxDocs=43254)
          0.375 = fieldNorm(doc=4940)
    
  5. Soledad Pera, M.; Ng, Y.-K.: Recommending books to be exchanged online in the absence of wish lists (2018) 3.55
    3.5541077 = sum of:
      3.5541077 = weight(author_txt:pera in 183) [ClassicSimilarity], result of:
        3.5541077 = fieldWeight in 183, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.47762 = idf(docFreq=8, maxDocs=43254)
          0.375 = fieldNorm(doc=183)
    

Similar documents (content)

  1. Leroy, G.; Miller, T.; Rosemblat, G.; Browne, A.: ¬A balanced approach to health information evaluation : a vocabulary-based naïve Bayes classifier and readability formulas (2008) 0.41
    0.41007727 = sum of:
      0.41007727 = product of:
        1.4645617 = sum of:
          0.025827795 = weight(abstract_txt:since in 3999) [ClassicSimilarity], result of:
            0.025827795 = score(doc=3999,freq=1.0), product of:
              0.06749142 = queryWeight, product of:
                1.1221329 = boost
                4.898338 = idf(docFreq=876, maxDocs=43254)
                0.01227879 = queryNorm
              0.38268265 = fieldWeight in 3999, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.898338 = idf(docFreq=876, maxDocs=43254)
                0.078125 = fieldNorm(doc=3999)
          0.044640303 = weight(abstract_txt:levels in 3999) [ClassicSimilarity], result of:
            0.044640303 = score(doc=3999,freq=2.0), product of:
              0.077149265 = queryWeight, product of:
                1.1997366 = boost
                5.2370934 = idf(docFreq=624, maxDocs=43254)
                0.01227879 = queryNorm
              0.5786225 = fieldWeight in 3999, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2370934 = idf(docFreq=624, maxDocs=43254)
                0.078125 = fieldNorm(doc=3999)
          0.013783793 = weight(abstract_txt:which in 3999) [ClassicSimilarity], result of:
            0.013783793 = score(doc=3999,freq=1.0), product of:
              0.060267456 = queryWeight, product of:
                1.6766076 = boost
                2.9274929 = idf(docFreq=6293, maxDocs=43254)
                0.01227879 = queryNorm
              0.22871038 = fieldWeight in 3999, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9274929 = idf(docFreq=6293, maxDocs=43254)
                0.078125 = fieldNorm(doc=3999)
          0.14768913 = weight(abstract_txt:grade in 3999) [ClassicSimilarity], result of:
            0.14768913 = score(doc=3999,freq=2.0), product of:
              0.17129554 = queryWeight, product of:
                1.7876934 = boost
                7.803644 = idf(docFreq=47, maxDocs=43254)
                0.01227879 = queryNorm
              0.862189 = fieldWeight in 3999, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.803644 = idf(docFreq=47, maxDocs=43254)
                0.078125 = fieldNorm(doc=3999)
          0.17601398 = weight(abstract_txt:formulas in 3999) [ClassicSimilarity], result of:
            0.17601398 = score(doc=3999,freq=2.0), product of:
              0.19255072 = queryWeight, product of:
                1.8953637 = boost
                8.273647 = idf(docFreq=29, maxDocs=43254)
                0.01227879 = queryNorm
              0.91411746 = fieldWeight in 3999, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.273647 = idf(docFreq=29, maxDocs=43254)
                0.078125 = fieldNorm(doc=3999)
          0.060921308 = weight(abstract_txt:level in 3999) [ClassicSimilarity], result of:
            0.060921308 = score(doc=3999,freq=1.0), product of:
              0.17248192 = queryWeight, product of:
                3.10708 = boost
                4.5210114 = idf(docFreq=1278, maxDocs=43254)
                0.01227879 = queryNorm
              0.353204 = fieldWeight in 3999, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5210114 = idf(docFreq=1278, maxDocs=43254)
                0.078125 = fieldNorm(doc=3999)
          0.99568546 = weight(abstract_txt:readability in 3999) [ClassicSimilarity], result of:
            0.99568546 = score(doc=3999,freq=4.0), product of:
              0.7702029 = queryWeight, product of:
                7.5814548 = boost
                8.273647 = idf(docFreq=29, maxDocs=43254)
                0.01227879 = queryNorm
              1.2927574 = fieldWeight in 3999, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.273647 = idf(docFreq=29, maxDocs=43254)
                0.078125 = fieldNorm(doc=3999)
        0.28 = coord(7/25)
    
  2. Azpiazu, I.M.; Soledad Pera, M.: Is cross-lingual readability assessment possible? (2020) 0.21
    0.2062712 = sum of:
      0.2062712 = product of:
        1.031356 = sum of:
          0.022095822 = weight(abstract_txt:levels in 869) [ClassicSimilarity], result of:
            0.022095822 = score(doc=869,freq=1.0), product of:
              0.077149265 = queryWeight, product of:
                1.1997366 = boost
                5.2370934 = idf(docFreq=624, maxDocs=43254)
                0.01227879 = queryNorm
              0.28640354 = fieldWeight in 869, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2370934 = idf(docFreq=624, maxDocs=43254)
                0.0546875 = fieldNorm(doc=869)
          0.10334418 = weight(abstract_txt:prediction in 869) [ClassicSimilarity], result of:
            0.10334418 = score(doc=869,freq=3.0), product of:
              0.14960356 = queryWeight, product of:
                1.6706711 = boost
                7.2928185 = idf(docFreq=79, maxDocs=43254)
                0.01227879 = queryNorm
              0.6907869 = fieldWeight in 869, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.2928185 = idf(docFreq=79, maxDocs=43254)
                0.0546875 = fieldNorm(doc=869)
          0.009648656 = weight(abstract_txt:which in 869) [ClassicSimilarity], result of:
            0.009648656 = score(doc=869,freq=1.0), product of:
              0.060267456 = queryWeight, product of:
                1.6766076 = boost
                2.9274929 = idf(docFreq=6293, maxDocs=43254)
                0.01227879 = queryNorm
              0.16009727 = fieldWeight in 869, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9274929 = idf(docFreq=6293, maxDocs=43254)
                0.0546875 = fieldNorm(doc=869)
          0.042644914 = weight(abstract_txt:level in 869) [ClassicSimilarity], result of:
            0.042644914 = score(doc=869,freq=1.0), product of:
              0.17248192 = queryWeight, product of:
                3.10708 = boost
                4.5210114 = idf(docFreq=1278, maxDocs=43254)
                0.01227879 = queryNorm
              0.24724281 = fieldWeight in 869, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5210114 = idf(docFreq=1278, maxDocs=43254)
                0.0546875 = fieldNorm(doc=869)
          0.85362244 = weight(abstract_txt:readability in 869) [ClassicSimilarity], result of:
            0.85362244 = score(doc=869,freq=6.0), product of:
              0.7702029 = queryWeight, product of:
                7.5814548 = boost
                8.273647 = idf(docFreq=29, maxDocs=43254)
                0.01227879 = queryNorm
              1.1083086 = fieldWeight in 869, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                8.273647 = idf(docFreq=29, maxDocs=43254)
                0.0546875 = fieldNorm(doc=869)
        0.2 = coord(5/25)
    
  3. Collins-Thompson, K.; Callan, J.: Predicting reading difficulty with statistical language models (2005) 0.17
    0.17246014 = sum of:
      0.17246014 = product of:
        0.8623007 = sum of:
          0.035712242 = weight(abstract_txt:levels in 580) [ClassicSimilarity], result of:
            0.035712242 = score(doc=580,freq=2.0), product of:
              0.077149265 = queryWeight, product of:
                1.1997366 = boost
                5.2370934 = idf(docFreq=624, maxDocs=43254)
                0.01227879 = queryNorm
              0.46289802 = fieldWeight in 580, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2370934 = idf(docFreq=624, maxDocs=43254)
                0.0625 = fieldNorm(doc=580)
          0.07626786 = weight(abstract_txt:reading in 580) [ClassicSimilarity], result of:
            0.07626786 = score(doc=580,freq=4.0), product of:
              0.10154801 = queryWeight, product of:
                1.376435 = boost
                6.008418 = idf(docFreq=288, maxDocs=43254)
                0.01227879 = queryNorm
              0.75105226 = fieldWeight in 580, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.008418 = idf(docFreq=288, maxDocs=43254)
                0.0625 = fieldNorm(doc=580)
          0.11815131 = weight(abstract_txt:grade in 580) [ClassicSimilarity], result of:
            0.11815131 = score(doc=580,freq=2.0), product of:
              0.17129554 = queryWeight, product of:
                1.7876934 = boost
                7.803644 = idf(docFreq=47, maxDocs=43254)
                0.01227879 = queryNorm
              0.6897512 = fieldWeight in 580, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.803644 = idf(docFreq=47, maxDocs=43254)
                0.0625 = fieldNorm(doc=580)
          0.06892459 = weight(abstract_txt:level in 580) [ClassicSimilarity], result of:
            0.06892459 = score(doc=580,freq=2.0), product of:
              0.17248192 = queryWeight, product of:
                3.10708 = boost
                4.5210114 = idf(docFreq=1278, maxDocs=43254)
                0.01227879 = queryNorm
              0.3996047 = fieldWeight in 580, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5210114 = idf(docFreq=1278, maxDocs=43254)
                0.0625 = fieldNorm(doc=580)
          0.5632447 = weight(abstract_txt:readability in 580) [ClassicSimilarity], result of:
            0.5632447 = score(doc=580,freq=2.0), product of:
              0.7702029 = queryWeight, product of:
                7.5814548 = boost
                8.273647 = idf(docFreq=29, maxDocs=43254)
                0.01227879 = queryNorm
              0.731294 = fieldWeight in 580, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.273647 = idf(docFreq=29, maxDocs=43254)
                0.0625 = fieldNorm(doc=580)
        0.2 = coord(5/25)
    
  4. Kauchak, D.; Leroy, G.; Hogue, A.: Measuring text difficulty using parse-tree frequency (2017) 0.17
    0.1703032 = sum of:
      0.1703032 = product of:
        0.85151595 = sum of:
          0.01848732 = weight(abstract_txt:analysis in 5251) [ClassicSimilarity], result of:
            0.01848732 = score(doc=5251,freq=2.0), product of:
              0.05693772 = queryWeight, product of:
                1.262309 = boost
                3.67349 = idf(docFreq=2984, maxDocs=43254)
                0.01227879 = queryNorm
              0.3246937 = fieldWeight in 5251, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.67349 = idf(docFreq=2984, maxDocs=43254)
                0.0625 = fieldNorm(doc=5251)
          0.14081118 = weight(abstract_txt:formulas in 5251) [ClassicSimilarity], result of:
            0.14081118 = score(doc=5251,freq=2.0), product of:
              0.19255072 = queryWeight, product of:
                1.8953637 = boost
                8.273647 = idf(docFreq=29, maxDocs=43254)
                0.01227879 = queryNorm
              0.731294 = fieldWeight in 5251, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.273647 = idf(docFreq=29, maxDocs=43254)
                0.0625 = fieldNorm(doc=5251)
          0.060048148 = weight(abstract_txt:readers in 5251) [ClassicSimilarity], result of:
            0.060048148 = score(doc=5251,freq=1.0), product of:
              0.15733567 = queryWeight, product of:
                2.0983562 = boost
                6.1065006 = idf(docFreq=261, maxDocs=43254)
                0.01227879 = queryNorm
              0.3816563 = fieldWeight in 5251, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1065006 = idf(docFreq=261, maxDocs=43254)
                0.0625 = fieldNorm(doc=5251)
          0.06892459 = weight(abstract_txt:level in 5251) [ClassicSimilarity], result of:
            0.06892459 = score(doc=5251,freq=2.0), product of:
              0.17248192 = queryWeight, product of:
                3.10708 = boost
                4.5210114 = idf(docFreq=1278, maxDocs=43254)
                0.01227879 = queryNorm
              0.3996047 = fieldWeight in 5251, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5210114 = idf(docFreq=1278, maxDocs=43254)
                0.0625 = fieldNorm(doc=5251)
          0.5632447 = weight(abstract_txt:readability in 5251) [ClassicSimilarity], result of:
            0.5632447 = score(doc=5251,freq=2.0), product of:
              0.7702029 = queryWeight, product of:
                7.5814548 = boost
                8.273647 = idf(docFreq=29, maxDocs=43254)
                0.01227879 = queryNorm
              0.731294 = fieldWeight in 5251, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.273647 = idf(docFreq=29, maxDocs=43254)
                0.0625 = fieldNorm(doc=5251)
        0.2 = coord(5/25)
    
  5. Jiang, Z.; Gu, Q.; Yin, Y.; Wang, J.; Chen, D.: GRAW+ : a two-view graph propagation method with word coupling for readability assessment (2019) 0.14
    0.14261197 = sum of:
      0.14261197 = product of:
        0.8913248 = sum of:
          0.02525237 = weight(abstract_txt:levels in 219) [ClassicSimilarity], result of:
            0.02525237 = score(doc=219,freq=1.0), product of:
              0.077149265 = queryWeight, product of:
                1.1997366 = boost
                5.2370934 = idf(docFreq=624, maxDocs=43254)
                0.01227879 = queryNorm
              0.32731834 = fieldWeight in 219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2370934 = idf(docFreq=624, maxDocs=43254)
                0.0625 = fieldNorm(doc=219)
          0.053929523 = weight(abstract_txt:reading in 219) [ClassicSimilarity], result of:
            0.053929523 = score(doc=219,freq=2.0), product of:
              0.10154801 = queryWeight, product of:
                1.376435 = boost
                6.008418 = idf(docFreq=288, maxDocs=43254)
                0.01227879 = queryNorm
              0.53107417 = fieldWeight in 219, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.008418 = idf(docFreq=288, maxDocs=43254)
                0.0625 = fieldNorm(doc=219)
          0.015594581 = weight(abstract_txt:which in 219) [ClassicSimilarity], result of:
            0.015594581 = score(doc=219,freq=2.0), product of:
              0.060267456 = queryWeight, product of:
                1.6766076 = boost
                2.9274929 = idf(docFreq=6293, maxDocs=43254)
                0.01227879 = queryNorm
              0.25875625 = fieldWeight in 219, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.9274929 = idf(docFreq=6293, maxDocs=43254)
                0.0625 = fieldNorm(doc=219)
          0.79654837 = weight(abstract_txt:readability in 219) [ClassicSimilarity], result of:
            0.79654837 = score(doc=219,freq=4.0), product of:
              0.7702029 = queryWeight, product of:
                7.5814548 = boost
                8.273647 = idf(docFreq=29, maxDocs=43254)
                0.01227879 = queryNorm
              1.0342059 = fieldWeight in 219, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.273647 = idf(docFreq=29, maxDocs=43254)
                0.0625 = fieldNorm(doc=219)
        0.16 = coord(4/25)