Document (#33203)

Author
Beagle, D.
Title
Visualizing keyword distribution across multidisciplinary c-space
Source
D-Lib magazine. 9(2003) no.6, x S
Year
2003
Abstract
The concept of c-space is proposed as a visualization schema relating containers of content to cataloging surrogates and classification structures. Possible applications of keyword vector clusters within c-space could include improved retrieval rates through the use of captioning within visual hierarchies, tracings of semantic bleeding among subclasses, and access to buried knowledge within subject-neutral publication containers. The Scholastica Project is described as one example, following a tradition of research dating back to the 1980's. Preliminary focus group assessment indicates that this type of classification rendering may offer digital library searchers enriched entry strategies and an expanded range of re-entry vocabularies. Those of us who work in traditional libraries typically assume that our systems of classification: Library of Congress Classification (LCC) and Dewey Decimal Classification (DDC), are descriptive rather than prescriptive. In other words, LCC classes and subclasses approximate natural groupings of texts that reflect an underlying order of knowledge, rather than arbitrary categories prescribed by librarians to facilitate efficient shelving. Philosophical support for this assumption has traditionally been found in a number of places, from the archetypal tree of knowledge, to Aristotelian categories, to the concept of discursive formations proposed by Michel Foucault. Gary P. Radford has elegantly described an encounter with Foucault's discursive formations in the traditional library setting: "Just by looking at the titles on the spines, you can see how the books cluster together...You can identify those books that seem to form the heart of the discursive formation and those books that reside on the margins. Moving along the shelves, you see those books that tend to bleed over into other classifications and that straddle multiple discursive formations. You can physically and sensually experience...those points that feel like state borders or national boundaries, those points where one subject ends and another begins, or those magical places where one subject has morphed into another..."
But what happens to this awareness in a digital library? Can discursive formations be represented in cyberspace, perhaps through diagrams in a visualization interface? And would such a schema be helpful to a digital library user? To approach this question, it is worth taking a moment to reconsider what Radford is looking at. First, he looks at titles to see how the books cluster. To illustrate, I scanned one hundred books on the shelves of a college library under subclass HT 101-395, defined by the LCC subclass caption as Urban groups. The City. Urban sociology. Of the first 100 titles in this sequence, fifty included the word "urban" or variants (e.g. "urbanization"). Another thirty-five used the word "city" or variants. These keywords appear to mark their titles as the heart of this discursive formation. The scattering of titles not using "urban" or "city" used related terms such as "town," "community," or in one case "skyscrapers." So we immediately see some empirical correlation between keywords and classification. But we also see a problem with the commonly used search technique of title-keyword. A student interested in urban studies will want to know about this entire subclass, and may wish to browse every title available therein. A title-keyword search on "urban" will retrieve only half of the titles, while a search on "city" will retrieve just over a third. There will be no overlap, since no titles in this sample contain both words. The only place where both words appear in a common string is in the LCC subclass caption, but captions are not typically indexed in library Online Public Access Catalogs (OPACs). In a traditional library, this problem is mitigated when the student goes to the shelf looking for any one of the books and suddenly discovers a much wider selection than the keyword search had led him to expect. But in a digital library, the issue of non-retrieval can be more problematic, as studies have indicated. Micco and Popp reported that, in a study funded partly by the U.S. Department of Education, 65 of 73 unskilled users searching for material on U.S./Soviet foreign relations found some material but never realized they had missed a large percentage of what was in the database.
Content
Für Scholastica, vgl.: http://beta.belmont.antarcti.ca:8080/start
Footnote
Vgl.: http://dlib.ukoln.ac.uk/dlib/june03/beagle/06beagle.html.
Theme
Visualisierung
Klassifikationssysteme im Online-Retrieval
Object
Scholastica

Similar documents (content)

  1. Howard, S.A.; Knowlton, S.A..: Browsing through bias : the Library of Congress Classification and Subject Headings for African American Studies and LGBTQIA Studies (2018) 0.20
    0.19593272 = sum of:
      0.19593272 = product of:
        0.5442575 = sum of:
          0.08064838 = weight(abstract_txt:shelves in 5519) [ClassicSimilarity], result of:
            0.08064838 = score(doc=5519,freq=1.0), product of:
              0.13155772 = queryWeight, product of:
                1.045224 = boost
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.016040495 = queryNorm
              0.61302656 = fieldWeight in 5519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.078125 = fieldNorm(doc=5519)
          0.06685806 = weight(abstract_txt:looking in 5519) [ClassicSimilarity], result of:
            0.06685806 = score(doc=5519,freq=1.0), product of:
              0.13289814 = queryWeight, product of:
                1.2866377 = boost
                6.439392 = idf(docFreq=191, maxDocs=44218)
                0.016040495 = queryNorm
              0.5030775 = fieldWeight in 5519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.439392 = idf(docFreq=191, maxDocs=44218)
                0.078125 = fieldNorm(doc=5519)
          0.014132268 = weight(abstract_txt:that in 5519) [ClassicSimilarity], result of:
            0.014132268 = score(doc=5519,freq=2.0), product of:
              0.053982712 = queryWeight, product of:
                1.4203154 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016040495 = queryNorm
              0.26179248 = fieldWeight in 5519, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=5519)
          0.010554115 = weight(abstract_txt:this in 5519) [ClassicSimilarity], result of:
            0.010554115 = score(doc=5519,freq=1.0), product of:
              0.055984955 = queryWeight, product of:
                1.4464157 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.016040495 = queryNorm
              0.18851699 = fieldWeight in 5519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.078125 = fieldNorm(doc=5519)
          0.045056798 = weight(abstract_txt:classification in 5519) [ClassicSimilarity], result of:
            0.045056798 = score(doc=5519,freq=2.0), product of:
              0.102154285 = queryWeight, product of:
                1.5952917 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.016040495 = queryNorm
              0.44106615 = fieldWeight in 5519, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.078125 = fieldNorm(doc=5519)
          0.042104855 = weight(abstract_txt:library in 5519) [ClassicSimilarity], result of:
            0.042104855 = score(doc=5519,freq=3.0), product of:
              0.09764226 = queryWeight, product of:
                1.910189 = boost
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.016040495 = queryNorm
              0.4312155 = fieldWeight in 5519, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.078125 = fieldNorm(doc=5519)
          0.046850674 = weight(abstract_txt:those in 5519) [ClassicSimilarity], result of:
            0.046850674 = score(doc=5519,freq=1.0), product of:
              0.13906543 = queryWeight, product of:
                2.010457 = boost
                4.312277 = idf(docFreq=1610, maxDocs=44218)
                0.016040495 = queryNorm
              0.33689663 = fieldWeight in 5519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.312277 = idf(docFreq=1610, maxDocs=44218)
                0.078125 = fieldNorm(doc=5519)
          0.08317072 = weight(abstract_txt:books in 5519) [ClassicSimilarity], result of:
            0.08317072 = score(doc=5519,freq=1.0), product of:
              0.20388672 = queryWeight, product of:
                2.4343312 = boost
                5.2214546 = idf(docFreq=648, maxDocs=44218)
                0.016040495 = queryNorm
              0.40792614 = fieldWeight in 5519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2214546 = idf(docFreq=648, maxDocs=44218)
                0.078125 = fieldNorm(doc=5519)
          0.15488163 = weight(abstract_txt:titles in 5519) [ClassicSimilarity], result of:
            0.15488163 = score(doc=5519,freq=2.0), product of:
              0.24494311 = queryWeight, product of:
                2.6681967 = boost
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.016040495 = queryNorm
              0.6323167 = fieldWeight in 5519, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.078125 = fieldNorm(doc=5519)
        0.36 = coord(9/25)
    
  2. Mowery, R.L.: ¬The classification of African history by the Library of Congress (1983) 0.16
    0.16331893 = sum of:
      0.16331893 = product of:
        0.5832819 = sum of:
          0.023064414 = weight(abstract_txt:will in 319) [ClassicSimilarity], result of:
            0.023064414 = score(doc=319,freq=1.0), product of:
              0.06371427 = queryWeight, product of:
                1.0286901 = boost
                3.8613079 = idf(docFreq=2528, maxDocs=44218)
                0.016040495 = queryNorm
              0.3619976 = fieldWeight in 319, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8613079 = idf(docFreq=2528, maxDocs=44218)
                0.09375 = fieldNorm(doc=319)
          0.04711465 = weight(abstract_txt:space in 319) [ClassicSimilarity], result of:
            0.04711465 = score(doc=319,freq=1.0), product of:
              0.09319648 = queryWeight, product of:
                1.0774486 = boost
                5.3924384 = idf(docFreq=546, maxDocs=44218)
                0.016040495 = queryNorm
              0.5055411 = fieldWeight in 319, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3924384 = idf(docFreq=546, maxDocs=44218)
                0.09375 = fieldNorm(doc=319)
          0.0119916275 = weight(abstract_txt:that in 319) [ClassicSimilarity], result of:
            0.0119916275 = score(doc=319,freq=1.0), product of:
              0.053982712 = queryWeight, product of:
                1.4203154 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016040495 = queryNorm
              0.22213829 = fieldWeight in 319, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.09375 = fieldNorm(doc=319)
          0.012664939 = weight(abstract_txt:this in 319) [ClassicSimilarity], result of:
            0.012664939 = score(doc=319,freq=1.0), product of:
              0.055984955 = queryWeight, product of:
                1.4464157 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.016040495 = queryNorm
              0.2262204 = fieldWeight in 319, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.09375 = fieldNorm(doc=319)
          0.03823196 = weight(abstract_txt:classification in 319) [ClassicSimilarity], result of:
            0.03823196 = score(doc=319,freq=1.0), product of:
              0.102154285 = queryWeight, product of:
                1.5952917 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.016040495 = queryNorm
              0.37425706 = fieldWeight in 319, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.09375 = fieldNorm(doc=319)
          0.0291711 = weight(abstract_txt:library in 319) [ClassicSimilarity], result of:
            0.0291711 = score(doc=319,freq=1.0), product of:
              0.09764226 = queryWeight, product of:
                1.910189 = boost
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.016040495 = queryNorm
              0.29875487 = fieldWeight in 319, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.09375 = fieldNorm(doc=319)
          0.42104316 = weight(abstract_txt:subclass in 319) [ClassicSimilarity], result of:
            0.42104316 = score(doc=319,freq=2.0), product of:
              0.35060346 = queryWeight, product of:
                2.4130943 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.016040495 = queryNorm
              1.2009099 = fieldWeight in 319, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.09375 = fieldNorm(doc=319)
        0.28 = coord(7/25)
    
  3. Tsay, M.-y.; Shu, Z.-y.: Journal bibliometric analysis : a case study on the Journal of Documentation (2011) 0.13
    0.13116315 = sum of:
      0.13116315 = product of:
        0.54651314 = sum of:
          0.120368645 = weight(abstract_txt:subclasses in 294) [ClassicSimilarity], result of:
            0.120368645 = score(doc=294,freq=2.0), product of:
              0.15824313 = queryWeight, product of:
                1.1463405 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.016040495 = queryNorm
              0.76065636 = fieldWeight in 294, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.0625 = fieldNorm(doc=294)
          0.013846738 = weight(abstract_txt:that in 294) [ClassicSimilarity], result of:
            0.013846738 = score(doc=294,freq=3.0), product of:
              0.053982712 = queryWeight, product of:
                1.4203154 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016040495 = queryNorm
              0.2565032 = fieldWeight in 294, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=294)
          0.011940618 = weight(abstract_txt:this in 294) [ClassicSimilarity], result of:
            0.011940618 = score(doc=294,freq=2.0), product of:
              0.055984955 = queryWeight, product of:
                1.4464157 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.016040495 = queryNorm
              0.21328263 = fieldWeight in 294, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0625 = fieldNorm(doc=294)
          0.038894802 = weight(abstract_txt:library in 294) [ClassicSimilarity], result of:
            0.038894802 = score(doc=294,freq=4.0), product of:
              0.09764226 = queryWeight, product of:
                1.910189 = boost
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.016040495 = queryNorm
              0.39833984 = fieldWeight in 294, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.0625 = fieldNorm(doc=294)
          0.19848165 = weight(abstract_txt:subclass in 294) [ClassicSimilarity], result of:
            0.19848165 = score(doc=294,freq=1.0), product of:
              0.35060346 = queryWeight, product of:
                2.4130943 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.016040495 = queryNorm
              0.56611437 = fieldWeight in 294, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.0625 = fieldNorm(doc=294)
          0.16298068 = weight(abstract_txt:books in 294) [ClassicSimilarity], result of:
            0.16298068 = score(doc=294,freq=6.0), product of:
              0.20388672 = queryWeight, product of:
                2.4343312 = boost
                5.2214546 = idf(docFreq=648, maxDocs=44218)
                0.016040495 = queryNorm
              0.79936874 = fieldWeight in 294, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.2214546 = idf(docFreq=648, maxDocs=44218)
                0.0625 = fieldNorm(doc=294)
        0.24 = coord(6/25)
    
  4. Rafferty, P.: ¬The representation of knowledge in library classification schemes (2001) 0.13
    0.12641577 = sum of:
      0.12641577 = product of:
        0.6320789 = sum of:
          0.008443292 = weight(abstract_txt:this in 640) [ClassicSimilarity], result of:
            0.008443292 = score(doc=640,freq=1.0), product of:
              0.055984955 = queryWeight, product of:
                1.4464157 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.016040495 = queryNorm
              0.1508136 = fieldWeight in 640, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0625 = fieldNorm(doc=640)
          0.08829293 = weight(abstract_txt:classification in 640) [ClassicSimilarity], result of:
            0.08829293 = score(doc=640,freq=12.0), product of:
              0.102154285 = queryWeight, product of:
                1.5952917 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.016040495 = queryNorm
              0.8643096 = fieldWeight in 640, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=640)
          0.019447401 = weight(abstract_txt:library in 640) [ClassicSimilarity], result of:
            0.019447401 = score(doc=640,freq=1.0), product of:
              0.09764226 = queryWeight, product of:
                1.910189 = boost
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.016040495 = queryNorm
              0.19916992 = fieldWeight in 640, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.0625 = fieldNorm(doc=640)
          0.23758945 = weight(abstract_txt:formations in 640) [ClassicSimilarity], result of:
            0.23758945 = score(doc=640,freq=1.0), product of:
              0.39526412 = queryWeight, product of:
                2.5621815 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.016040495 = queryNorm
              0.6010904 = fieldWeight in 640, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.0625 = fieldNorm(doc=640)
          0.27830583 = weight(abstract_txt:discursive in 640) [ClassicSimilarity], result of:
            0.27830583 = score(doc=640,freq=2.0), product of:
              0.39905974 = queryWeight, product of:
                3.1530495 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.016040495 = queryNorm
              0.6974039 = fieldWeight in 640, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.0625 = fieldNorm(doc=640)
        0.2 = coord(5/25)
    
  5. Bachir, I.; Buxton, A.: ¬The use of topic sentences for evaluating the representativeness of Arabic article titles (1993) 0.12
    0.1221252 = sum of:
      0.1221252 = product of:
        0.610626 = sum of:
          0.07982786 = weight(abstract_txt:words in 6985) [ClassicSimilarity], result of:
            0.07982786 = score(doc=6985,freq=3.0), product of:
              0.091838494 = queryWeight, product of:
                1.06957 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.016040495 = queryNorm
              0.86922 = fieldWeight in 6985, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.09375 = fieldNorm(doc=6985)
          0.09755511 = weight(abstract_txt:title in 6985) [ClassicSimilarity], result of:
            0.09755511 = score(doc=6985,freq=3.0), product of:
              0.104975626 = queryWeight, product of:
                1.1435128 = boost
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.016040495 = queryNorm
              0.929312 = fieldWeight in 6985, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.09375 = fieldNorm(doc=6985)
          0.0291711 = weight(abstract_txt:library in 6985) [ClassicSimilarity], result of:
            0.0291711 = score(doc=6985,freq=1.0), product of:
              0.09764226 = queryWeight, product of:
                1.910189 = boost
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.016040495 = queryNorm
              0.29875487 = fieldWeight in 6985, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.09375 = fieldNorm(doc=6985)
          0.1102047 = weight(abstract_txt:keyword in 6985) [ClassicSimilarity], result of:
            0.1102047 = score(doc=6985,freq=1.0), product of:
              0.19470564 = queryWeight, product of:
                2.0105295 = boost
                6.037405 = idf(docFreq=286, maxDocs=44218)
                0.016040495 = queryNorm
              0.5660067 = fieldWeight in 6985, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.037405 = idf(docFreq=286, maxDocs=44218)
                0.09375 = fieldNorm(doc=6985)
          0.2938672 = weight(abstract_txt:titles in 6985) [ClassicSimilarity], result of:
            0.2938672 = score(doc=6985,freq=5.0), product of:
              0.24494311 = queryWeight, product of:
                2.6681967 = boost
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.016040495 = queryNorm
              1.1997366 = fieldWeight in 6985, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.09375 = fieldNorm(doc=6985)
        0.2 = coord(5/25)