Search (3 results, page 1 of 1)

  • × year_i:[2010 TO 2020}
  • × author_ss:"Yi, K."
  1. Yi, K.: ¬A semantic similarity approach to predicting Library of Congress subject headings for social tags (2010) 0.03
    0.03327266 = product of:
      0.06654532 = sum of:
        0.06654532 = product of:
          0.13309065 = sum of:
            0.13309065 = weight(_text_:headings in 3707) [ClassicSimilarity], result of:
              0.13309065 = score(doc=3707,freq=8.0), product of:
                0.24837378 = queryWeight, product of:
                  4.849944 = idf(docFreq=940, maxDocs=44218)
                  0.051211677 = queryNorm
                0.5358482 = fieldWeight in 3707, product of:
                  2.828427 = tf(freq=8.0), with freq of:
                    8.0 = termFreq=8.0
                  4.849944 = idf(docFreq=940, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3707)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Social tagging or collaborative tagging has become a new trend in the organization, management, and discovery of digital information. The rapid growth of shared information mostly controlled by social tags poses a new challenge for social tag-based information organization and retrieval. A plausible approach for this challenge is linking social tags to a controlled vocabulary. As an introductory step for this approach, this study investigates ways of predicting relevant subject headings for resources from social tags assigned to the resources. The prediction of subject headings was measured by five different similarity measures: tf-idf, cosine-based similarity (CoS), Jaccard similarity (or Jaccard coefficient; JS), Mutual information (MI), and information radius (IRad). Their results were compared to those by professionals. The results show that a CoS measure based on top five social tags was most effective. Inclusions of more social tags only aggravate the performance. The performance of JS is comparable to the performance of CoS while tf-idf is comparable with up to 70% less than the best performance. MI and IRad have inferior performance compared to the other methods. This study demonstrates the application of the similarity measuring techniques to the prediction of correct Library of Congress subject headings.
  2. Yi, K.; Chan, L.M.: Revisiting the syntactical and structural analysis of Library of Congress Subject Headings for the digital environment (2010) 0.03
    0.028232787 = product of:
      0.056465574 = sum of:
        0.056465574 = product of:
          0.11293115 = sum of:
            0.11293115 = weight(_text_:headings in 3431) [ClassicSimilarity], result of:
              0.11293115 = score(doc=3431,freq=4.0), product of:
                0.24837378 = queryWeight, product of:
                  4.849944 = idf(docFreq=940, maxDocs=44218)
                  0.051211677 = queryNorm
                0.45468226 = fieldWeight in 3431, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.849944 = idf(docFreq=940, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3431)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    With the current information environment characterized by the proliferation of digital resources, including collaboratively created and shared resources, Library of Congress Subject Headings (LCSH) is facing the challenges of effective and efficient subject-based organization and retrieval of digital resources. To explore the feasibility of utilizing LCSH in a digital environment, we might need to revisit its basic characteristics. The objectives of our study were to analyze LCSH in both syntactic and relational structures, to discover the structural characteristics of LCSH, and to identify problems and issues for the feasibility of LCSH as an effective subject access tool. This study reports and discusses issues raised by the syntactic and hierarchical structures of LCSH that present challenges to its use in a networked environment. Given the results of this study, we recommend a number of provisional future directions for the development of LCSH towards further becoming a viable system for digital and networked resources.
  3. Yi, K.: Harnessing collective intelligence in social tagging using Delicious (2012) 0.01
    0.008673094 = product of:
      0.017346188 = sum of:
        0.017346188 = product of:
          0.034692377 = sum of:
            0.034692377 = weight(_text_:22 in 515) [ClassicSimilarity], result of:
              0.034692377 = score(doc=515,freq=2.0), product of:
                0.17933457 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.051211677 = queryNorm
                0.19345059 = fieldWeight in 515, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=515)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    25.12.2012 15:22:37