Document (#40325)

Author
Hook, P.A.
Title
Using course-subject Co-occurrence (CSCO) to reveal the structure of an academic discipline : a framework to evaluate different inputs of a domain map
Source
Journal of the Association for Information Science and Technology. 68(2017) no.1, S.182-196
Year
2017
Abstract
This article proposes, exemplifies, and validates the use of course-subject co-occurrence (CSCO) data to generate topic maps of an academic discipline. A CSCO event is when 2 course-subjects are taught in the same academic year by the same teacher. A total of 61,856 CSCO events were extracted from the 2010-11 directory of the American Association of Law Schools and used to visualize the structure of law school education in the United States. Different normalization, ordination (layout), and clustering algorithms were compared and the best performing algorithm of each type was used to generate the final map. Validation studies demonstrate that CSCO produces topic maps that are consistent with expert opinion and 4 other indicators of the topical similarity of law school course-subjects. This research is the first to use CSCO to produce a visualization of a domain. It is also the first to use an expanded, multi-part gold standard to evaluate the validity of domain maps and the intermediate steps in their creation. It is suggested that the framework used herein may be adopted for other studies that compare different inputs of a domain map in order to empirically derive the best maps as measured against extrinsic sources of topical similarity (gold standards).
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23630/full.

Similar documents (content)

  1. Ku, C.-H.; Leroy, G.: ¬A crime reports analysis system to identify related crimes (2011) 0.24
    0.24076042 = sum of:
      0.24076042 = product of:
        0.6687789 = sum of:
          0.011510131 = weight(abstract_txt:that in 4629) [ClassicSimilarity], result of:
            0.011510131 = score(doc=4629,freq=2.0), product of:
              0.054958276 = queryWeight, product of:
                1.0620332 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.021839509 = queryNorm
              0.20943399 = fieldWeight in 4629, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=4629)
          0.032603905 = weight(abstract_txt:same in 4629) [ClassicSimilarity], result of:
            0.032603905 = score(doc=4629,freq=1.0), product of:
              0.110025324 = queryWeight, product of:
                1.0625585 = boost
                4.7412944 = idf(docFreq=1048, maxDocs=44218)
                0.021839509 = queryNorm
              0.2963309 = fieldWeight in 4629, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7412944 = idf(docFreq=1048, maxDocs=44218)
                0.0625 = fieldNorm(doc=4629)
          0.03860645 = weight(abstract_txt:best in 4629) [ClassicSimilarity], result of:
            0.03860645 = score(doc=4629,freq=1.0), product of:
              0.123145774 = queryWeight, product of:
                1.1241293 = boost
                5.0160327 = idf(docFreq=796, maxDocs=44218)
                0.021839509 = queryNorm
              0.31350204 = fieldWeight in 4629, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0160327 = idf(docFreq=796, maxDocs=44218)
                0.0625 = fieldNorm(doc=4629)
          0.017394667 = weight(abstract_txt:used in 4629) [ClassicSimilarity], result of:
            0.017394667 = score(doc=4629,freq=1.0), product of:
              0.08284903 = queryWeight, product of:
                1.1292651 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.021839509 = queryNorm
              0.2099562 = fieldWeight in 4629, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.0625 = fieldNorm(doc=4629)
          0.08016841 = weight(abstract_txt:evaluate in 4629) [ClassicSimilarity], result of:
            0.08016841 = score(doc=4629,freq=3.0), product of:
              0.1389765 = queryWeight, product of:
                1.1942004 = boost
                5.3287 = idf(docFreq=582, maxDocs=44218)
                0.021839509 = queryNorm
              0.5768487 = fieldWeight in 4629, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.3287 = idf(docFreq=582, maxDocs=44218)
                0.0625 = fieldNorm(doc=4629)
          0.031958427 = weight(abstract_txt:different in 4629) [ClassicSimilarity], result of:
            0.031958427 = score(doc=4629,freq=2.0), product of:
              0.09864088 = queryWeight, product of:
                1.2321984 = boost
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.021839509 = queryNorm
              0.32398763 = fieldWeight in 4629, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.0625 = fieldNorm(doc=4629)
          0.18083356 = weight(abstract_txt:similarity in 4629) [ClassicSimilarity], result of:
            0.18083356 = score(doc=4629,freq=9.0), product of:
              0.16573648 = queryWeight, product of:
                1.3041141 = boost
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.021839509 = queryNorm
              1.0910909 = fieldWeight in 4629, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.0625 = fieldNorm(doc=4629)
          0.21073055 = weight(abstract_txt:gold in 4629) [ClassicSimilarity], result of:
            0.21073055 = score(doc=4629,freq=2.0), product of:
              0.30300835 = queryWeight, product of:
                1.7633309 = boost
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.021839509 = queryNorm
              0.6954612 = fieldWeight in 4629, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.0625 = fieldNorm(doc=4629)
          0.06497278 = weight(abstract_txt:domain in 4629) [ClassicSimilarity], result of:
            0.06497278 = score(doc=4629,freq=1.0), product of:
              0.21952158 = queryWeight, product of:
                2.1225607 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.021839509 = queryNorm
              0.29597446 = fieldWeight in 4629, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.0625 = fieldNorm(doc=4629)
        0.36 = coord(9/25)
    
  2. Klavans, R.; Boyack, K.W.: Identifying a better measure of relatedness for mapping science (2006) 0.18
    0.17617752 = sum of:
      0.17617752 = product of:
        0.55055475 = sum of:
          0.028831782 = weight(abstract_txt:framework in 5252) [ClassicSimilarity], result of:
            0.028831782 = score(doc=5252,freq=1.0), product of:
              0.10136637 = queryWeight, product of:
                1.0198903 = boost
                4.550903 = idf(docFreq=1268, maxDocs=44218)
                0.021839509 = queryNorm
              0.28443143 = fieldWeight in 5252, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.550903 = idf(docFreq=1268, maxDocs=44218)
                0.0625 = fieldNorm(doc=5252)
          0.014096973 = weight(abstract_txt:that in 5252) [ClassicSimilarity], result of:
            0.014096973 = score(doc=5252,freq=3.0), product of:
              0.054958276 = queryWeight, product of:
                1.0620332 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.021839509 = queryNorm
              0.2565032 = fieldWeight in 5252, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=5252)
          0.066868335 = weight(abstract_txt:best in 5252) [ClassicSimilarity], result of:
            0.066868335 = score(doc=5252,freq=3.0), product of:
              0.123145774 = queryWeight, product of:
                1.1241293 = boost
                5.0160327 = idf(docFreq=796, maxDocs=44218)
                0.021839509 = queryNorm
              0.5430015 = fieldWeight in 5252, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.0160327 = idf(docFreq=796, maxDocs=44218)
                0.0625 = fieldNorm(doc=5252)
          0.024599774 = weight(abstract_txt:used in 5252) [ClassicSimilarity], result of:
            0.024599774 = score(doc=5252,freq=2.0), product of:
              0.08284903 = queryWeight, product of:
                1.1292651 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.021839509 = queryNorm
              0.2969229 = fieldWeight in 5252, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.0625 = fieldNorm(doc=5252)
          0.046285257 = weight(abstract_txt:evaluate in 5252) [ClassicSimilarity], result of:
            0.046285257 = score(doc=5252,freq=1.0), product of:
              0.1389765 = queryWeight, product of:
                1.1942004 = boost
                5.3287 = idf(docFreq=582, maxDocs=44218)
                0.021839509 = queryNorm
              0.33304375 = fieldWeight in 5252, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3287 = idf(docFreq=582, maxDocs=44218)
                0.0625 = fieldNorm(doc=5252)
          0.02259802 = weight(abstract_txt:different in 5252) [ClassicSimilarity], result of:
            0.02259802 = score(doc=5252,freq=1.0), product of:
              0.09864088 = queryWeight, product of:
                1.2321984 = boost
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.021839509 = queryNorm
              0.22909386 = fieldWeight in 5252, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.0625 = fieldNorm(doc=5252)
          0.17059276 = weight(abstract_txt:inputs in 5252) [ClassicSimilarity], result of:
            0.17059276 = score(doc=5252,freq=1.0), product of:
              0.33160415 = queryWeight, product of:
                1.8446608 = boost
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.021839509 = queryNorm
              0.514447 = fieldWeight in 5252, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.0625 = fieldNorm(doc=5252)
          0.17668188 = weight(abstract_txt:maps in 5252) [ClassicSimilarity], result of:
            0.17668188 = score(doc=5252,freq=2.0), product of:
              0.33944878 = queryWeight, product of:
                2.639421 = boost
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.021839509 = queryNorm
              0.5204964 = fieldWeight in 5252, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.0625 = fieldNorm(doc=5252)
        0.32 = coord(8/25)
    
  3. Wang, P.: ¬An empirical study of knowledge structures of research topics (1999) 0.16
    0.15740357 = sum of:
      0.15740357 = product of:
        0.5621556 = sum of:
          0.008138892 = weight(abstract_txt:that in 6667) [ClassicSimilarity], result of:
            0.008138892 = score(doc=6667,freq=1.0), product of:
              0.054958276 = queryWeight, product of:
                1.0620332 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.021839509 = queryNorm
              0.1480922 = fieldWeight in 6667, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=6667)
          0.032603905 = weight(abstract_txt:same in 6667) [ClassicSimilarity], result of:
            0.032603905 = score(doc=6667,freq=1.0), product of:
              0.110025324 = queryWeight, product of:
                1.0625585 = boost
                4.7412944 = idf(docFreq=1048, maxDocs=44218)
                0.021839509 = queryNorm
              0.2963309 = fieldWeight in 6667, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7412944 = idf(docFreq=1048, maxDocs=44218)
                0.0625 = fieldNorm(doc=6667)
          0.06873394 = weight(abstract_txt:topic in 6667) [ClassicSimilarity], result of:
            0.06873394 = score(doc=6667,freq=3.0), product of:
              0.12542574 = queryWeight, product of:
                1.1344879 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.021839509 = queryNorm
              0.54800504 = fieldWeight in 6667, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.0625 = fieldNorm(doc=6667)
          0.05884375 = weight(abstract_txt:discipline in 6667) [ClassicSimilarity], result of:
            0.05884375 = score(doc=6667,freq=1.0), product of:
              0.16309719 = queryWeight, product of:
                1.2936887 = boost
                5.7726316 = idf(docFreq=373, maxDocs=44218)
                0.021839509 = queryNorm
              0.36078948 = fieldWeight in 6667, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7726316 = idf(docFreq=373, maxDocs=44218)
                0.0625 = fieldNorm(doc=6667)
          0.0674346 = weight(abstract_txt:generate in 6667) [ClassicSimilarity], result of:
            0.0674346 = score(doc=6667,freq=1.0), product of:
              0.17860822 = queryWeight, product of:
                1.3538085 = boost
                6.0408955 = idf(docFreq=285, maxDocs=44218)
                0.021839509 = queryNorm
              0.37755597 = fieldWeight in 6667, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0408955 = idf(docFreq=285, maxDocs=44218)
                0.0625 = fieldNorm(doc=6667)
          0.0470419 = weight(abstract_txt:academic in 6667) [ClassicSimilarity], result of:
            0.0470419 = score(doc=6667,freq=1.0), product of:
              0.16081747 = queryWeight, product of:
                1.5733262 = boost
                4.6802773 = idf(docFreq=1114, maxDocs=44218)
                0.021839509 = queryNorm
              0.29251733 = fieldWeight in 6667, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6802773 = idf(docFreq=1114, maxDocs=44218)
                0.0625 = fieldNorm(doc=6667)
          0.2793586 = weight(abstract_txt:maps in 6667) [ClassicSimilarity], result of:
            0.2793586 = score(doc=6667,freq=5.0), product of:
              0.33944878 = queryWeight, product of:
                2.639421 = boost
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.021839509 = queryNorm
              0.8229771 = fieldWeight in 6667, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.0625 = fieldNorm(doc=6667)
        0.28 = coord(7/25)
    
  4. Buchel, O.; Coleman, A.: How can classificatory structures be used to improve science education? (2003) 0.15
    0.15360051 = sum of:
      0.15360051 = product of:
        0.64000213 = sum of:
          0.014243062 = weight(abstract_txt:that in 155) [ClassicSimilarity], result of:
            0.014243062 = score(doc=155,freq=1.0), product of:
              0.054958276 = queryWeight, product of:
                1.0620332 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.021839509 = queryNorm
              0.25916135 = fieldWeight in 155, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.109375 = fieldNorm(doc=155)
          0.03044067 = weight(abstract_txt:used in 155) [ClassicSimilarity], result of:
            0.03044067 = score(doc=155,freq=1.0), product of:
              0.08284903 = queryWeight, product of:
                1.1292651 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.021839509 = queryNorm
              0.36742336 = fieldWeight in 155, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.109375 = fieldNorm(doc=155)
          0.06944623 = weight(abstract_txt:topic in 155) [ClassicSimilarity], result of:
            0.06944623 = score(doc=155,freq=1.0), product of:
              0.12542574 = queryWeight, product of:
                1.1344879 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.021839509 = queryNorm
              0.553684 = fieldWeight in 155, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.109375 = fieldNorm(doc=155)
          0.10297656 = weight(abstract_txt:discipline in 155) [ClassicSimilarity], result of:
            0.10297656 = score(doc=155,freq=1.0), product of:
              0.16309719 = queryWeight, product of:
                1.2936887 = boost
                5.7726316 = idf(docFreq=373, maxDocs=44218)
                0.021839509 = queryNorm
              0.6313816 = fieldWeight in 155, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7726316 = idf(docFreq=373, maxDocs=44218)
                0.109375 = fieldNorm(doc=155)
          0.113702364 = weight(abstract_txt:domain in 155) [ClassicSimilarity], result of:
            0.113702364 = score(doc=155,freq=1.0), product of:
              0.21952158 = queryWeight, product of:
                2.1225607 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.021839509 = queryNorm
              0.5179553 = fieldWeight in 155, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.109375 = fieldNorm(doc=155)
          0.30919328 = weight(abstract_txt:maps in 155) [ClassicSimilarity], result of:
            0.30919328 = score(doc=155,freq=2.0), product of:
              0.33944878 = queryWeight, product of:
                2.639421 = boost
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.021839509 = queryNorm
              0.91086876 = fieldWeight in 155, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.109375 = fieldNorm(doc=155)
        0.24 = coord(6/25)
    
  5. Kim, J.-M.; Shin, H.; Kim, H.-J.: Schema and constraints-based matching and merging of Topic Maps (2007) 0.13
    0.13352405 = sum of:
      0.13352405 = product of:
        0.55635023 = sum of:
          0.008138892 = weight(abstract_txt:that in 922) [ClassicSimilarity], result of:
            0.008138892 = score(doc=922,freq=1.0), product of:
              0.054958276 = queryWeight, product of:
                1.0620332 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.021839509 = queryNorm
              0.1480922 = fieldWeight in 922, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=922)
          0.017394667 = weight(abstract_txt:used in 922) [ClassicSimilarity], result of:
            0.017394667 = score(doc=922,freq=1.0), product of:
              0.08284903 = queryWeight, product of:
                1.1292651 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.021839509 = queryNorm
              0.2099562 = fieldWeight in 922, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.0625 = fieldNorm(doc=922)
          0.11905068 = weight(abstract_txt:topic in 922) [ClassicSimilarity], result of:
            0.11905068 = score(doc=922,freq=9.0), product of:
              0.12542574 = queryWeight, product of:
                1.1344879 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.021839509 = queryNorm
              0.9491726 = fieldWeight in 922, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.0625 = fieldNorm(doc=922)
          0.0674346 = weight(abstract_txt:generate in 922) [ClassicSimilarity], result of:
            0.0674346 = score(doc=922,freq=1.0), product of:
              0.17860822 = queryWeight, product of:
                1.3538085 = boost
                6.0408955 = idf(docFreq=285, maxDocs=44218)
                0.021839509 = queryNorm
              0.37755597 = fieldWeight in 922, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0408955 = idf(docFreq=285, maxDocs=44218)
                0.0625 = fieldNorm(doc=922)
          0.06497278 = weight(abstract_txt:domain in 922) [ClassicSimilarity], result of:
            0.06497278 = score(doc=922,freq=1.0), product of:
              0.21952158 = queryWeight, product of:
                2.1225607 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.021839509 = queryNorm
              0.29597446 = fieldWeight in 922, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.0625 = fieldNorm(doc=922)
          0.2793586 = weight(abstract_txt:maps in 922) [ClassicSimilarity], result of:
            0.2793586 = score(doc=922,freq=5.0), product of:
              0.33944878 = queryWeight, product of:
                2.639421 = boost
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.021839509 = queryNorm
              0.8229771 = fieldWeight in 922, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.0625 = fieldNorm(doc=922)
        0.24 = coord(6/25)