Document (#37505)

Author
McArthur, D.
Crompton, H.
Title
Understanding public-access cyberlearning projects using text mining and topic analysis
Source
Journal of the American Society for Information Science and Technology. 63(2012) no.11, S.2146-2152
Year
2012
Abstract
The federal government has encouraged open access to publicly funded federal science research results, but it is unclear what knowledge can be gleaned from them and how the knowledge can be used to improve scientific research and shape federal research policies. In this article, we present the results of a preliminary study of cyberlearning projects funded by the National Science Foundation (NSF) that address these issues. Our work demonstrates that text-mining tools can be used to partially automate the process of finding NSF's cyberlearning awards and characterizing the fine-grained topics implicit in award abstracts. The methodology we have established to assess NSF's cyberlearning investments should generalize to other areas of research and other repositories of public-access documents.

Similar documents (content)

  1. Zia, L.L.: ¬The NSF National Science, Technology, Engineering, and Mathematics Education Digital Library (NSDL) Program : new projects from fiscal year 2004 (2005) 0.19
    0.19172488 = sum of:
      0.19172488 = product of:
        0.5325691 = sum of:
          0.010465114 = weight(abstract_txt:other in 1221) [ClassicSimilarity], result of:
            0.010465114 = score(doc=1221,freq=1.0), product of:
              0.07609921 = queryWeight, product of:
                1.010933 = boost
                3.5204957 = idf(docFreq=3555, maxDocs=44218)
                0.021382276 = queryNorm
              0.13751936 = fieldWeight in 1221, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5204957 = idf(docFreq=3555, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1221)
          0.07403862 = weight(abstract_txt:encouraged in 1221) [ClassicSimilarity], result of:
            0.07403862 = score(doc=1221,freq=2.0), product of:
              0.1766728 = queryWeight, product of:
                1.0891863 = boost
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.021382276 = queryNorm
              0.41907197 = fieldWeight in 1221, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1221)
          0.027607841 = weight(abstract_txt:science in 1221) [ClassicSimilarity], result of:
            0.027607841 = score(doc=1221,freq=4.0), product of:
              0.09152768 = queryWeight, product of:
                1.108686 = boost
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.021382276 = queryNorm
              0.3016338 = fieldWeight in 1221, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1221)
          0.0887789 = weight(abstract_txt:award in 1221) [ClassicSimilarity], result of:
            0.0887789 = score(doc=1221,freq=2.0), product of:
              0.19940558 = queryWeight, product of:
                1.1571403 = boost
                8.059301 = idf(docFreq=37, maxDocs=44218)
                0.021382276 = queryNorm
              0.44521773 = fieldWeight in 1221, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.059301 = idf(docFreq=37, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1221)
          0.14763367 = weight(abstract_txt:awards in 1221) [ClassicSimilarity], result of:
            0.14763367 = score(doc=1221,freq=3.0), product of:
              0.24450666 = queryWeight, product of:
                1.281335 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.021382276 = queryNorm
              0.60380226 = fieldWeight in 1221, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1221)
          0.02347498 = weight(abstract_txt:public in 1221) [ClassicSimilarity], result of:
            0.02347498 = score(doc=1221,freq=1.0), product of:
              0.13040301 = queryWeight, product of:
                1.3233544 = boost
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.021382276 = queryNorm
              0.1800187 = fieldWeight in 1221, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1221)
          0.11249036 = weight(abstract_txt:projects in 1221) [ClassicSimilarity], result of:
            0.11249036 = score(doc=1221,freq=10.0), product of:
              0.17203932 = queryWeight, product of:
                1.520009 = boost
                5.293313 = idf(docFreq=603, maxDocs=44218)
                0.021382276 = queryNorm
              0.65386426 = fieldWeight in 1221, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                5.293313 = idf(docFreq=603, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1221)
          0.01750861 = weight(abstract_txt:access in 1221) [ClassicSimilarity], result of:
            0.01750861 = score(doc=1221,freq=1.0), product of:
              0.1227672 = queryWeight, product of:
                1.5726032 = boost
                3.6509786 = idf(docFreq=3120, maxDocs=44218)
                0.021382276 = queryNorm
              0.14261635 = fieldWeight in 1221, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6509786 = idf(docFreq=3120, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1221)
          0.030570967 = weight(abstract_txt:research in 1221) [ClassicSimilarity], result of:
            0.030570967 = score(doc=1221,freq=4.0), product of:
              0.12342797 = queryWeight, product of:
                1.820766 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.021382276 = queryNorm
              0.24768265 = fieldWeight in 1221, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1221)
        0.36 = coord(9/25)
    
  2. McKrell, L.; Green, A.; Harris, K.: Libraries and community development national survey (1997) 0.16
    0.16145393 = sum of:
      0.16145393 = product of:
        0.5766212 = sum of:
          0.024310172 = weight(abstract_txt:results in 2984) [ClassicSimilarity], result of:
            0.024310172 = score(doc=2984,freq=1.0), product of:
              0.07446211 = queryWeight, product of:
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.021382276 = queryNorm
              0.32647708 = fieldWeight in 2984, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.09375 = fieldNorm(doc=2984)
          0.025116276 = weight(abstract_txt:other in 2984) [ClassicSimilarity], result of:
            0.025116276 = score(doc=2984,freq=1.0), product of:
              0.07609921 = queryWeight, product of:
                1.010933 = boost
                3.5204957 = idf(docFreq=3555, maxDocs=44218)
                0.021382276 = queryNorm
              0.33004647 = fieldWeight in 2984, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5204957 = idf(docFreq=3555, maxDocs=44218)
                0.09375 = fieldNorm(doc=2984)
          0.15066278 = weight(abstract_txt:award in 2984) [ClassicSimilarity], result of:
            0.15066278 = score(doc=2984,freq=1.0), product of:
              0.19940558 = queryWeight, product of:
                1.1571403 = boost
                8.059301 = idf(docFreq=37, maxDocs=44218)
                0.021382276 = queryNorm
              0.7555595 = fieldWeight in 2984, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.059301 = idf(docFreq=37, maxDocs=44218)
                0.09375 = fieldNorm(doc=2984)
          0.079676725 = weight(abstract_txt:public in 2984) [ClassicSimilarity], result of:
            0.079676725 = score(doc=2984,freq=2.0), product of:
              0.13040301 = queryWeight, product of:
                1.3233544 = boost
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.021382276 = queryNorm
              0.6110037 = fieldWeight in 2984, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.09375 = fieldNorm(doc=2984)
          0.085374184 = weight(abstract_txt:projects in 2984) [ClassicSimilarity], result of:
            0.085374184 = score(doc=2984,freq=1.0), product of:
              0.17203932 = queryWeight, product of:
                1.520009 = boost
                5.293313 = idf(docFreq=603, maxDocs=44218)
                0.021382276 = queryNorm
              0.4962481 = fieldWeight in 2984, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.293313 = idf(docFreq=603, maxDocs=44218)
                0.09375 = fieldNorm(doc=2984)
          0.05188065 = weight(abstract_txt:research in 2984) [ClassicSimilarity], result of:
            0.05188065 = score(doc=2984,freq=2.0), product of:
              0.12342797 = queryWeight, product of:
                1.820766 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.021382276 = queryNorm
              0.4203314 = fieldWeight in 2984, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.09375 = fieldNorm(doc=2984)
          0.1596004 = weight(abstract_txt:funded in 2984) [ClassicSimilarity], result of:
            0.1596004 = score(doc=2984,freq=1.0), product of:
              0.2610754 = queryWeight, product of:
                1.8724719 = boost
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.021382276 = queryNorm
              0.6113192 = fieldWeight in 2984, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.09375 = fieldNorm(doc=2984)
        0.28 = coord(7/25)
    
  3. Miao, Q.; Li, Q.; Zeng, D.: Fine-grained opinion mining by integrating multiple review sources (2010) 0.13
    0.12869285 = sum of:
      0.12869285 = product of:
        0.5362202 = sum of:
          0.024310172 = weight(abstract_txt:results in 4104) [ClassicSimilarity], result of:
            0.024310172 = score(doc=4104,freq=1.0), product of:
              0.07446211 = queryWeight, product of:
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.021382276 = queryNorm
              0.32647708 = fieldWeight in 4104, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.09375 = fieldNorm(doc=4104)
          0.036506224 = weight(abstract_txt:knowledge in 4104) [ClassicSimilarity], result of:
            0.036506224 = score(doc=4104,freq=2.0), product of:
              0.07750171 = queryWeight, product of:
                1.0202062 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.021382276 = queryNorm
              0.4710377 = fieldWeight in 4104, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.09375 = fieldNorm(doc=4104)
          0.106802255 = weight(abstract_txt:fine in 4104) [ClassicSimilarity], result of:
            0.106802255 = score(doc=4104,freq=1.0), product of:
              0.15853322 = queryWeight, product of:
                1.0317571 = boost
                7.1860275 = idf(docFreq=90, maxDocs=44218)
                0.021382276 = queryNorm
              0.6736901 = fieldWeight in 4104, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1860275 = idf(docFreq=90, maxDocs=44218)
                0.09375 = fieldNorm(doc=4104)
          0.14019984 = weight(abstract_txt:grained in 4104) [ClassicSimilarity], result of:
            0.14019984 = score(doc=4104,freq=1.0), product of:
              0.19006334 = queryWeight, product of:
                1.1297088 = boost
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.021382276 = queryNorm
              0.737648 = fieldWeight in 4104, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.09375 = fieldNorm(doc=4104)
          0.19171657 = weight(abstract_txt:mining in 4104) [ClassicSimilarity], result of:
            0.19171657 = score(doc=4104,freq=2.0), product of:
              0.23415661 = queryWeight, product of:
                1.7733136 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.021382276 = queryNorm
              0.8187536 = fieldWeight in 4104, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.09375 = fieldNorm(doc=4104)
          0.03668516 = weight(abstract_txt:research in 4104) [ClassicSimilarity], result of:
            0.03668516 = score(doc=4104,freq=1.0), product of:
              0.12342797 = queryWeight, product of:
                1.820766 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.021382276 = queryNorm
              0.2972192 = fieldWeight in 4104, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.09375 = fieldNorm(doc=4104)
        0.24 = coord(6/25)
    
  4. Rusch-Feja, D.; Becker, H.J.: Global Info : the German digital libraries project (1999) 0.13
    0.12777494 = sum of:
      0.12777494 = product of:
        0.4563391 = sum of:
          0.012155086 = weight(abstract_txt:results in 1242) [ClassicSimilarity], result of:
            0.012155086 = score(doc=1242,freq=1.0), product of:
              0.07446211 = queryWeight, product of:
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.021382276 = queryNorm
              0.16323854 = fieldWeight in 1242, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.046875 = fieldNorm(doc=1242)
          0.017759888 = weight(abstract_txt:other in 1242) [ClassicSimilarity], result of:
            0.017759888 = score(doc=1242,freq=2.0), product of:
              0.07609921 = queryWeight, product of:
                1.010933 = boost
                3.5204957 = idf(docFreq=3555, maxDocs=44218)
                0.021382276 = queryNorm
              0.23337808 = fieldWeight in 1242, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5204957 = idf(docFreq=3555, maxDocs=44218)
                0.046875 = fieldNorm(doc=1242)
          0.016564704 = weight(abstract_txt:science in 1242) [ClassicSimilarity], result of:
            0.016564704 = score(doc=1242,freq=1.0), product of:
              0.09152768 = queryWeight, product of:
                1.108686 = boost
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.021382276 = queryNorm
              0.18098028 = fieldWeight in 1242, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.046875 = fieldNorm(doc=1242)
          0.06036867 = weight(abstract_txt:projects in 1242) [ClassicSimilarity], result of:
            0.06036867 = score(doc=1242,freq=2.0), product of:
              0.17203932 = queryWeight, product of:
                1.520009 = boost
                5.293313 = idf(docFreq=603, maxDocs=44218)
                0.021382276 = queryNorm
              0.3509004 = fieldWeight in 1242, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.293313 = idf(docFreq=603, maxDocs=44218)
                0.046875 = fieldNorm(doc=1242)
          0.029713096 = weight(abstract_txt:access in 1242) [ClassicSimilarity], result of:
            0.029713096 = score(doc=1242,freq=2.0), product of:
              0.1227672 = queryWeight, product of:
                1.5726032 = boost
                3.6509786 = idf(docFreq=3120, maxDocs=44218)
                0.021382276 = queryNorm
              0.24202797 = fieldWeight in 1242, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6509786 = idf(docFreq=3120, maxDocs=44218)
                0.046875 = fieldNorm(doc=1242)
          0.041015256 = weight(abstract_txt:research in 1242) [ClassicSimilarity], result of:
            0.041015256 = score(doc=1242,freq=5.0), product of:
              0.12342797 = queryWeight, product of:
                1.820766 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.021382276 = queryNorm
              0.33230114 = fieldWeight in 1242, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.046875 = fieldNorm(doc=1242)
          0.2787624 = weight(abstract_txt:federal in 1242) [ClassicSimilarity], result of:
            0.2787624 = score(doc=1242,freq=3.0), product of:
              0.47706345 = queryWeight, product of:
                3.100031 = boost
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.021382276 = queryNorm
              0.58432984 = fieldWeight in 1242, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.046875 = fieldNorm(doc=1242)
        0.28 = coord(7/25)
    
  5. Jayroe, T.J.: ¬A humble servant : the work of Helen L. Brownson and the early years of information science research (2012) 0.12
    0.11841738 = sum of:
      0.11841738 = product of:
        0.5920869 = sum of:
          0.055215683 = weight(abstract_txt:science in 458) [ClassicSimilarity], result of:
            0.055215683 = score(doc=458,freq=4.0), product of:
              0.09152768 = queryWeight, product of:
                1.108686 = boost
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.021382276 = queryNorm
              0.6032676 = fieldWeight in 458, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.078125 = fieldNorm(doc=458)
          0.100614436 = weight(abstract_txt:projects in 458) [ClassicSimilarity], result of:
            0.100614436 = score(doc=458,freq=2.0), product of:
              0.17203932 = queryWeight, product of:
                1.520009 = boost
                5.293313 = idf(docFreq=603, maxDocs=44218)
                0.021382276 = queryNorm
              0.584834 = fieldWeight in 458, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.293313 = idf(docFreq=603, maxDocs=44218)
                0.078125 = fieldNorm(doc=458)
          0.03501722 = weight(abstract_txt:access in 458) [ClassicSimilarity], result of:
            0.03501722 = score(doc=458,freq=1.0), product of:
              0.1227672 = queryWeight, product of:
                1.5726032 = boost
                3.6509786 = idf(docFreq=3120, maxDocs=44218)
                0.021382276 = queryNorm
              0.2852327 = fieldWeight in 458, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6509786 = idf(docFreq=3120, maxDocs=44218)
                0.078125 = fieldNorm(doc=458)
          0.13300033 = weight(abstract_txt:funded in 458) [ClassicSimilarity], result of:
            0.13300033 = score(doc=458,freq=1.0), product of:
              0.2610754 = queryWeight, product of:
                1.8724719 = boost
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.021382276 = queryNorm
              0.5094326 = fieldWeight in 458, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.078125 = fieldNorm(doc=458)
          0.2682393 = weight(abstract_txt:federal in 458) [ClassicSimilarity], result of:
            0.2682393 = score(doc=458,freq=1.0), product of:
              0.47706345 = queryWeight, product of:
                3.100031 = boost
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.021382276 = queryNorm
              0.5622717 = fieldWeight in 458, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.078125 = fieldNorm(doc=458)
        0.2 = coord(5/25)