Search (8 results, page 1 of 1)

  • × author_ss:"Yi, K."
  1. Yi, K.; Chan, L.M.: Revisiting the syntactical and structural analysis of Library of Congress Subject Headings for the digital environment (2010) 0.02
    0.019294888 = product of:
      0.038589776 = sum of:
        0.038589776 = product of:
          0.07717955 = sum of:
            0.07717955 = weight(_text_:subject in 3431) [ClassicSimilarity], result of:
              0.07717955 = score(doc=3431,freq=8.0), product of:
                0.16275941 = queryWeight, product of:
                  3.576596 = idf(docFreq=3361, maxDocs=44218)
                  0.04550679 = queryNorm
                0.4741941 = fieldWeight in 3431, product of:
                  2.828427 = tf(freq=8.0), with freq of:
                    8.0 = termFreq=8.0
                  3.576596 = idf(docFreq=3361, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3431)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    With the current information environment characterized by the proliferation of digital resources, including collaboratively created and shared resources, Library of Congress Subject Headings (LCSH) is facing the challenges of effective and efficient subject-based organization and retrieval of digital resources. To explore the feasibility of utilizing LCSH in a digital environment, we might need to revisit its basic characteristics. The objectives of our study were to analyze LCSH in both syntactic and relational structures, to discover the structural characteristics of LCSH, and to identify problems and issues for the feasibility of LCSH as an effective subject access tool. This study reports and discusses issues raised by the syntactic and hierarchical structures of LCSH that present challenges to its use in a networked environment. Given the results of this study, we recommend a number of provisional future directions for the development of LCSH towards further becoming a viable system for digital and networked resources.
  2. Yi, K.: ¬A semantic similarity approach to predicting Library of Congress subject headings for social tags (2010) 0.02
    0.016079074 = product of:
      0.032158148 = sum of:
        0.032158148 = product of:
          0.064316295 = sum of:
            0.064316295 = weight(_text_:subject in 3707) [ClassicSimilarity], result of:
              0.064316295 = score(doc=3707,freq=8.0), product of:
                0.16275941 = queryWeight, product of:
                  3.576596 = idf(docFreq=3361, maxDocs=44218)
                  0.04550679 = queryNorm
                0.39516178 = fieldWeight in 3707, product of:
                  2.828427 = tf(freq=8.0), with freq of:
                    8.0 = termFreq=8.0
                  3.576596 = idf(docFreq=3361, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3707)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Social tagging or collaborative tagging has become a new trend in the organization, management, and discovery of digital information. The rapid growth of shared information mostly controlled by social tags poses a new challenge for social tag-based information organization and retrieval. A plausible approach for this challenge is linking social tags to a controlled vocabulary. As an introductory step for this approach, this study investigates ways of predicting relevant subject headings for resources from social tags assigned to the resources. The prediction of subject headings was measured by five different similarity measures: tf-idf, cosine-based similarity (CoS), Jaccard similarity (or Jaccard coefficient; JS), Mutual information (MI), and information radius (IRad). Their results were compared to those by professionals. The results show that a CoS measure based on top five social tags was most effective. Inclusions of more social tags only aggravate the performance. The performance of JS is comparable to the performance of CoS while tf-idf is comparable with up to 70% less than the best performance. MI and IRad have inferior performance compared to the other methods. This study demonstrates the application of the similarity measuring techniques to the prediction of correct Library of Congress subject headings.
  3. Yi, K.; Chan, L.M.: ¬A visualization software tool for Library of Congress Subject Headings (2008) 0.01
    0.013643546 = product of:
      0.027287092 = sum of:
        0.027287092 = product of:
          0.054574184 = sum of:
            0.054574184 = weight(_text_:subject in 2503) [ClassicSimilarity], result of:
              0.054574184 = score(doc=2503,freq=4.0), product of:
                0.16275941 = queryWeight, product of:
                  3.576596 = idf(docFreq=3361, maxDocs=44218)
                  0.04550679 = queryNorm
                0.33530587 = fieldWeight in 2503, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.576596 = idf(docFreq=3361, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2503)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Content
    The aim of this study is to develop a software tool, VisuaLCSH, for effective searching, browsing, and maintenance of LCSH. This tool enables visualizing subject headings and hierarchical structures implied and embedded in LCSH. A conceptual framework for converting the hierarchical structure of headings in LCSH to an explicit tree structure is proposed, described, and implemented. The highlights of VisuaLCSH are summarized below: 1) revealing multiple aspects of a heading; 2) normalizing the hierarchical relationships in LCSH; 3) showing multi-level hierarchies in LCSH sub-trees; 4) improving the navigational function of LCSH in retrieval; and 5) enabling the implementation of generic search, i.e., the 'exploding' feature, in searching LCSH.
  4. Yi, K.; Chan, L.M.: Linking folksonomy to Library of Congress subject headings : an exploratory study (2009) 0.01
    0.01113991 = product of:
      0.02227982 = sum of:
        0.02227982 = product of:
          0.04455964 = sum of:
            0.04455964 = weight(_text_:subject in 3616) [ClassicSimilarity], result of:
              0.04455964 = score(doc=3616,freq=6.0), product of:
                0.16275941 = queryWeight, product of:
                  3.576596 = idf(docFreq=3361, maxDocs=44218)
                  0.04550679 = queryNorm
                0.2737761 = fieldWeight in 3616, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  3.576596 = idf(docFreq=3361, maxDocs=44218)
                  0.03125 = fieldNorm(doc=3616)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Purpose - The purpose of this paper is to investigate the linking of a folksonomy (user vocabulary) and LCSH (controlled vocabulary) on the basis of word matching, for the potential use of LCSH in bringing order to folksonomies. Design/methodology/approach - A selected sample of a folksonomy from a popular collaborative tagging system, Delicious, was word-matched with LCSH. LCSH was transformed into a tree structure called an LCSH tree for the matching. A close examination was conducted on the characteristics of folksonomies, the overlap of folksonomies with LCSH, and the distribution of folksonomies over the LCSH tree. Findings - The experimental results showed that the total proportion of tags being matched with LC subject headings constituted approximately two-thirds of all tags involved, with an additional 10 percent of the remaining tags having potential matches. A number of barriers for the linking as well as two areas in need of improving the matching are identified and described. Three important tag distribution patterns over the LCSH tree were identified and supported: skewedness, multifacet, and Zipfian-pattern. Research limitations/implications - The results of the study can be adopted for the development of innovative methods of mapping between folksonomy and LCSH, which directly contributes to effective access and retrieval of tagged web resources and to the integration of multiple information repositories based on the two vocabularies. Practical implications - The linking of controlled vocabularies can be applicable to enhance information retrieval capability within collaborative tagging systems as well as across various tagging system information depositories and bibliographic databases. Originality/value - This is among frontier works that examines the potential of linking a folksonomy, extracted from a collaborative tagging system, to an authority-maintained subject heading system. It provides exploratory data to support further advanced mapping methods for linking the two vocabularies.
  5. Yi, K.: Automatic text classification using library classification schemes : trends, issues and challenges (2007) 0.01
    0.010789698 = product of:
      0.021579396 = sum of:
        0.021579396 = product of:
          0.043158792 = sum of:
            0.043158792 = weight(_text_:22 in 2560) [ClassicSimilarity], result of:
              0.043158792 = score(doc=2560,freq=2.0), product of:
                0.15935703 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04550679 = queryNorm
                0.2708308 = fieldWeight in 2560, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2560)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 9.2008 18:31:54
  6. Yi, K.; Beheshti, J.; Cole, C.; Leide, J.E.; Large, A.: User search behavior of domain-specific information retrieval systems : an analysis of the query logs from PsycINFO and ABC-Clio's Historical Abstracts/America: History and Life (2006) 0.01
    0.008039537 = product of:
      0.016079074 = sum of:
        0.016079074 = product of:
          0.032158148 = sum of:
            0.032158148 = weight(_text_:subject in 197) [ClassicSimilarity], result of:
              0.032158148 = score(doc=197,freq=2.0), product of:
                0.16275941 = queryWeight, product of:
                  3.576596 = idf(docFreq=3361, maxDocs=44218)
                  0.04550679 = queryNorm
                0.19758089 = fieldWeight in 197, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.576596 = idf(docFreq=3361, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=197)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    The authors report the findings of a study that analyzes and compares the query logs of PsycINFO for psychology and the two history databases of ABC-Clio: Historical Abstracts and America: History and Life to establish the sociological nature of information need, searching, and seeking in history versus psychology. Two problems are addressed: (a) What level of query log analysis - by individual query terms, by co-occurrence of word pairs, or by multiword terms (MWTs) - best serves as data for categorizing the queries to these two subject-bound databases; and (b) how can the differences in the nature of the queries to history versus psychology databases aid in our understanding of user search behavior and the information needs of their respective users. The authors conclude that MWTs provide the most effective snapshot of user searching behavior for query categorization. The MWTs to ABC-Clio indicate specific instances of historical events, people, and regions, whereas the MWTs to PsycINFO indicate concepts roughly equivalent to descriptors used by PsycINFO's own classification scheme. The average length of queries is 3.16 terms for PsycINFO and 3.42 for ABC-Clio, which breaks from findings for other reference and scholarly search engine studies, bringing query length closer in line to findings for general Web search engines like Excite.
  7. Yi, K.; Choi, N.; Kim, Y.S.: ¬A content analysis of Twitter hyperlinks and their application in web resource indexing (2016) 0.01
    0.008039537 = product of:
      0.016079074 = sum of:
        0.016079074 = product of:
          0.032158148 = sum of:
            0.032158148 = weight(_text_:subject in 3075) [ClassicSimilarity], result of:
              0.032158148 = score(doc=3075,freq=2.0), product of:
                0.16275941 = queryWeight, product of:
                  3.576596 = idf(docFreq=3361, maxDocs=44218)
                  0.04550679 = queryNorm
                0.19758089 = fieldWeight in 3075, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.576596 = idf(docFreq=3361, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3075)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Twitter has emerged as a popular source of sharing and delivering news information. In tweet messages, URLs to web resources and hashtags are often included. This study investigates the potential of the hyperlinks and hashtags as topical clues and indicators to tweet messages. For this study, we crawled and analyzed about 1.5 million tweets for a 3-month period covering any topic or subject. The findings of this study revealed a power law relationship for the ranking and frequency of (a) the host names of URLs, and (b) a pair of hashtags and URLs that appeared in the tweet messages. This study also discovered that the most popular URLs used in tweets come from news and media websites, and a majority of the hyperlinked resources are news web pages. One implication of this study is that Twitter users are becoming more active in sharing already published information than producing new information. Finally, our investigation on hashtags for web resource indexing reveals that hashtags have the potential to be used as indexing terms for co-occurring URLs in the same tweet. We also discuss the implications of this study for web resource recommendation.
  8. Yi, K.: Harnessing collective intelligence in social tagging using Delicious (2012) 0.01
    0.0077069276 = product of:
      0.015413855 = sum of:
        0.015413855 = product of:
          0.03082771 = sum of:
            0.03082771 = weight(_text_:22 in 515) [ClassicSimilarity], result of:
              0.03082771 = score(doc=515,freq=2.0), product of:
                0.15935703 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04550679 = queryNorm
                0.19345059 = fieldWeight in 515, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=515)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    25.12.2012 15:22:37