Search (11 results, page 1 of 1)

  • × author_ss:"Zhang, J."
  1. Zhang, J.: TOFIR: A tool of facilitating information retrieval : introduce a visual retrieval model (2001) 0.01
    0.012186137 = product of:
      0.024372274 = sum of:
        0.024372274 = product of:
          0.17060591 = sum of:
            0.17060591 = weight(_text_:visual in 7711) [ClassicSimilarity], result of:
              0.17060591 = score(doc=7711,freq=2.0), product of:
                0.2084343 = queryWeight, product of:
                  5.291659 = idf(docFreq=604, maxDocs=44218)
                  0.039389215 = queryNorm
                0.8185117 = fieldWeight in 7711, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  5.291659 = idf(docFreq=604, maxDocs=44218)
                  0.109375 = fieldNorm(doc=7711)
          0.14285715 = coord(1/7)
      0.5 = coord(1/2)
    
  2. Zhang, J.; Nguyen, T.: WebStar: a visualization model for hyperlink structures (2005) 0.01
    0.0116781555 = product of:
      0.023356311 = sum of:
        0.023356311 = product of:
          0.16349417 = sum of:
            0.16349417 = weight(_text_:visual in 1056) [ClassicSimilarity], result of:
              0.16349417 = score(doc=1056,freq=10.0), product of:
                0.2084343 = queryWeight, product of:
                  5.291659 = idf(docFreq=604, maxDocs=44218)
                  0.039389215 = queryNorm
                0.7843919 = fieldWeight in 1056, product of:
                  3.1622777 = tf(freq=10.0), with freq of:
                    10.0 = termFreq=10.0
                  5.291659 = idf(docFreq=604, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1056)
          0.14285715 = coord(1/7)
      0.5 = coord(1/2)
    
    Abstract
    The authors introduce an information visualization model, WebStar, for hyperlink-based information systems. Hyperlinks within a hyperlink-based document can be visualized in a two-dimensional visual space. All links are projected within a display sphere in the visual space. The relationship between a specified central document and its hyperlinked documents is visually presented in the visual space. In addition, users are able to define a group of subjects and to observe relevance between each subject and all hyperlinked documents via movement of that subject around the display sphere center. WebStar allows users to dynamically change an interest center during navigation. A retrieval mechanism is developed to control retrieved results in the visual space. Impact of movement of a subject on the visual document distribution is analyzed. An ambiguity problem caused by projection is discussed. Potential applications of this visualization model in information retrieval are included. Future research directions on the topic are addressed.
  3. Zhang, J.; Korfhage, R.R.: DARE: Distance and Angle Retrieval Environment : A tale of the two measures (1999) 0.01
    0.009847886 = product of:
      0.019695772 = sum of:
        0.019695772 = product of:
          0.1378704 = sum of:
            0.1378704 = weight(_text_:visual in 3916) [ClassicSimilarity], result of:
              0.1378704 = score(doc=3916,freq=4.0), product of:
                0.2084343 = queryWeight, product of:
                  5.291659 = idf(docFreq=604, maxDocs=44218)
                  0.039389215 = queryNorm
                0.66145736 = fieldWeight in 3916, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  5.291659 = idf(docFreq=604, maxDocs=44218)
                  0.0625 = fieldNorm(doc=3916)
          0.14285715 = coord(1/7)
      0.5 = coord(1/2)
    
    Abstract
    This article presents a visualization tool for information retrieval. Some retrieval evaluation models are interpreted in the two-dimensional space comprising direction and distance. The two different similarity measures-angle and distance-are displayed in the visual space. A new retrieval means based on the visual retrieval tool, the controlling bar, is developed for a search
  4. Zhang, J.; Wolfram, D.; Wang, P.; Hong, Y.; Gillis, R.: Visualization of health-subject analysis based on query term co-occurrences (2008) 0.01
    0.0075382185 = product of:
      0.015076437 = sum of:
        0.015076437 = product of:
          0.10553505 = sum of:
            0.10553505 = weight(_text_:visual in 2376) [ClassicSimilarity], result of:
              0.10553505 = score(doc=2376,freq=6.0), product of:
                0.2084343 = queryWeight, product of:
                  5.291659 = idf(docFreq=604, maxDocs=44218)
                  0.039389215 = queryNorm
                0.50632286 = fieldWeight in 2376, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  5.291659 = idf(docFreq=604, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2376)
          0.14285715 = coord(1/7)
      0.5 = coord(1/2)
    
    Abstract
    A multidimensional-scaling approach is used to analyze frequently used medical-topic terms in queries submitted to a Web-based consumer health information system. Based on a year-long transaction log file, five medical focus keywords (stomach, hip, stroke, depression, and cholesterol) and their co-occurring query terms are analyzed. An overlap-coefficient similarity measure and a conversion measure are used to calculate the proximity of terms to one another based on their co-occurrences in queries. The impact of the dimensionality of the visual configuration, the cutoff point of term co-occurrence for inclusion in the analysis, and the Minkowski metric power k on the stress value are discussed. A visual clustering of groups of terms based on the proximity within each focus-keyword group is also conducted. Term distributions within each visual configuration are characterized and are compared with formal medical vocabulary. This investigation reveals that there are significant differences between consumer health query-term usage and more formal medical terminology used by medical professionals when describing the same medical subject. Future directions are discussed.
  5. Zhang, J.; An, L.; Tang, T.; Hong, Y.: Visual health subject directory analysis based on users' traversal activities (2009) 0.01
    0.0073859147 = product of:
      0.014771829 = sum of:
        0.014771829 = product of:
          0.1034028 = sum of:
            0.1034028 = weight(_text_:visual in 3112) [ClassicSimilarity], result of:
              0.1034028 = score(doc=3112,freq=4.0), product of:
                0.2084343 = queryWeight, product of:
                  5.291659 = idf(docFreq=604, maxDocs=44218)
                  0.039389215 = queryNorm
                0.49609303 = fieldWeight in 3112, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  5.291659 = idf(docFreq=604, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3112)
          0.14285715 = coord(1/7)
      0.5 = coord(1/2)
    
    Abstract
    Concerns about health issues cover a wide spectrum. Consumer health information, which has become more available on the Internet, plays an extremely important role in addressing these concerns. A subject directory as an information organization and browsing mechanism is widely used in consumer health-related Websites. In this study we employed the information visualization technique Self-Organizing Map (SOM) in combination with a new U-matrix algorithm to analyze health subject clusters through a Web transaction log. An experimental study was conducted to test the proposed methods. The findings show that the clusters identified from the same cells based on path-length-1 outperformed both the clusters from the adjacent cells based on path-length-1 and the clusters from the same cells based on path-length-2 in the visual SOM display. The U-matrix method successfully distinguished the irrelevant subjects situated in the adjacent cells with different colors in the SOM display. The findings of this study lead to a better understanding of the health-related subject relationship from the users' traversal perspective.
  6. Zhang, J.; Zeng, M.L.: ¬A new similarity measure for subject hierarchical structures (2014) 0.01
    0.006670869 = product of:
      0.013341738 = sum of:
        0.013341738 = product of:
          0.026683476 = sum of:
            0.026683476 = weight(_text_:22 in 1778) [ClassicSimilarity], result of:
              0.026683476 = score(doc=1778,freq=2.0), product of:
                0.13793433 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.039389215 = queryNorm
                0.19345059 = fieldWeight in 1778, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1778)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    8. 4.2015 16:22:13
  7. Zhang, J.; Wolfram, D.; Wang, P.: Analysis of query keywords of sports-related queries using visualization and clustering (2009) 0.01
    0.006154929 = product of:
      0.012309858 = sum of:
        0.012309858 = product of:
          0.086169 = sum of:
            0.086169 = weight(_text_:visual in 2947) [ClassicSimilarity], result of:
              0.086169 = score(doc=2947,freq=4.0), product of:
                0.2084343 = queryWeight, product of:
                  5.291659 = idf(docFreq=604, maxDocs=44218)
                  0.039389215 = queryNorm
                0.41341084 = fieldWeight in 2947, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  5.291659 = idf(docFreq=604, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2947)
          0.14285715 = coord(1/7)
      0.5 = coord(1/2)
    
    Abstract
    The authors investigated 11 sports-related query keywords extracted from a public search engine query log to better understand sports-related information seeking on the Internet. After the query log contents were cleaned and query data were parsed, popular sports-related keywords were identified, along with frequently co-occurring query terms associated with the identified keywords. Relationships among each sports-related focus keyword and its related keywords were characterized and grouped using multidimensional scaling (MDS) in combination with traditional hierarchical clustering methods. The two approaches were synthesized in a visual context by highlighting the results of the hierarchical clustering analysis in the visual MDS configuration. Important events, people, subjects, merchandise, and so on related to a sport were illustrated, and relationships among the sports were analyzed. A small-scale comparative study of sports searches with and without term assistance was conducted. Searches that used search term assistance by relying on previous query term relationships outperformed the searches without the search term assistance. The findings of this study provide insights into sports information seeking behavior on the Internet. The developed method also may be applied to other query log subject areas.
  8. An, L.; Zhang, J.; Yu, C.: ¬The visual subject analysis of library and information science journals with self-organizing map (2011) 0.00
    0.004352192 = product of:
      0.008704384 = sum of:
        0.008704384 = product of:
          0.060930684 = sum of:
            0.060930684 = weight(_text_:visual in 4613) [ClassicSimilarity], result of:
              0.060930684 = score(doc=4613,freq=2.0), product of:
                0.2084343 = queryWeight, product of:
                  5.291659 = idf(docFreq=604, maxDocs=44218)
                  0.039389215 = queryNorm
                0.29232562 = fieldWeight in 4613, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  5.291659 = idf(docFreq=604, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4613)
          0.14285715 = coord(1/7)
      0.5 = coord(1/2)
    
  9. Zhang, J.; Zhao, Y.: ¬A user term visualization analysis based on a social question and answer log (2013) 0.00
    0.004352192 = product of:
      0.008704384 = sum of:
        0.008704384 = product of:
          0.060930684 = sum of:
            0.060930684 = weight(_text_:visual in 2715) [ClassicSimilarity], result of:
              0.060930684 = score(doc=2715,freq=2.0), product of:
                0.2084343 = queryWeight, product of:
                  5.291659 = idf(docFreq=604, maxDocs=44218)
                  0.039389215 = queryNorm
                0.29232562 = fieldWeight in 2715, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  5.291659 = idf(docFreq=604, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2715)
          0.14285715 = coord(1/7)
      0.5 = coord(1/2)
    
    Abstract
    The authors of this paper investigate terms of consumers' diabetes based on a log from the Yahoo!Answers social question and answers (Q&A) forum, ascertain characteristics and relationships among terms related to diabetes from the consumers' perspective, and reveal users' diabetes information seeking patterns. In this study, the log analysis method, data coding method, and visualization multiple-dimensional scaling analysis method were used for analysis. The visual analyses were conducted at two levels: terms analysis within a category and category analysis among the categories in the schema. The findings show that the average number of words per question was 128.63, the average number of sentences per question was 8.23, the average number of words per response was 254.83, and the average number of sentences per response was 16.01. There were 12 categories (Cause & Pathophysiology, Sign & Symptom, Diagnosis & Test, Organ & Body Part, Complication & Related Disease, Medication, Treatment, Education & Info Resource, Affect, Social & Culture, Lifestyle, and Nutrient) in the diabetes related schema which emerged from the data coding analysis. The analyses at the two levels show that terms and categories were clustered and patterns were revealed. Future research directions are also included.
  10. Zhang, J.; Zhai, S.; Stevenson, J.A.; Xia, L.: Optimization of the subject directory in a government agriculture department web portal (2016) 0.00
    0.004352192 = product of:
      0.008704384 = sum of:
        0.008704384 = product of:
          0.060930684 = sum of:
            0.060930684 = weight(_text_:visual in 3088) [ClassicSimilarity], result of:
              0.060930684 = score(doc=3088,freq=2.0), product of:
                0.2084343 = queryWeight, product of:
                  5.291659 = idf(docFreq=604, maxDocs=44218)
                  0.039389215 = queryNorm
                0.29232562 = fieldWeight in 3088, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  5.291659 = idf(docFreq=604, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3088)
          0.14285715 = coord(1/7)
      0.5 = coord(1/2)
    
    Abstract
    We investigated a subject directory in the US Agriculture Department-Economic Research Service portal. Parent-child relationships, related connections among the categories, and related connections among the subcategories in the subject directory were optimized using social network analysis. The optimization results were assessed by both density analysis and edge strength analysis methods. In addition, the results were evaluated by domain experts. From this study, it is recommended that four subcategories be switched from their original four categories into two different categories as a result of the parent-child relationship optimization.?It is also recommended that 132 subcategories be moved to 40 subcategories and that eight categories be moved to two categories as a result of the related connection optimization. The findings show that optimization boosted the densities of the optimized categories, and the recommended connections of both the related categories and subcategories were stronger than the existing connections of the related categories and subcategories. This paper provides visual displays of the optimization analysis as well as suggestions to enhance the subject directory of this portal.
  11. Zhang, J.; Mostafa, J.; Tripathy, H.: Information retrieval by semantic analysis and visualization of the concept space of D-Lib® magazine (2002) 0.00
    0.0030774644 = product of:
      0.006154929 = sum of:
        0.006154929 = product of:
          0.0430845 = sum of:
            0.0430845 = weight(_text_:visual in 1211) [ClassicSimilarity], result of:
              0.0430845 = score(doc=1211,freq=4.0), product of:
                0.2084343 = queryWeight, product of:
                  5.291659 = idf(docFreq=604, maxDocs=44218)
                  0.039389215 = queryNorm
                0.20670542 = fieldWeight in 1211, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  5.291659 = idf(docFreq=604, maxDocs=44218)
                  0.01953125 = fieldNorm(doc=1211)
          0.14285715 = coord(1/7)
      0.5 = coord(1/2)
    
    Abstract
    In this article we present a method for retrieving documents from a digital library through a visual interface based on automatically generated concepts. We used a vocabulary generation algorithm to generate a set of concepts for the digital library and a technique called the max-min distance technique to cluster them. Additionally, the concepts were visualized in a spring embedding graph layout to depict the semantic relationship among them. The resulting graph layout serves as an aid to users for retrieving documents. An online archive containing the contents of D-Lib Magazine from July 1995 to May 2002 was used to test the utility of an implemented retrieval and visualization system. We believe that the method developed and tested can be applied to many different domains to help users get a better understanding of online document collections and to minimize users' cognitive load during execution of search tasks. Over the past few years, the volume of information available through the World Wide Web has been expanding exponentially. Never has so much information been so readily available and shared among so many people. Unfortunately, the unstructured nature and huge volume of information accessible over networks have made it hard for users to sift through and find relevant information. To deal with this problem, information retrieval (IR) techniques have gained more intensive attention from both industrial and academic researchers. Numerous IR techniques have been developed to help deal with the information overload problem. These techniques concentrate on mathematical models and algorithms for retrieval. Popular IR models such as the Boolean model, the vector-space model, the probabilistic model and their variants are well established.
    Nevertheless, because thesaurus use has shown to improve retrieval, for our method we integrate functions in the search interface that permit users to explore built-in search vocabularies to improve retrieval from digital libraries. Our method automatically generates the terms and their semantic relationships representing relevant topics covered in a digital library. We call these generated terms the "concepts", and the generated terms and their semantic relationships we call the "concept space". Additionally, we used a visualization technique to display the concept space and allow users to interact with this space. The automatically generated term set is considered to be more representative of subject area in a corpus than an "externally" imposed thesaurus, and our method has the potential of saving a significant amount of time and labor for those who have been manually creating thesauri as well. Information visualization is an emerging discipline and developed very quickly in the last decade. With growing volumes of documents and associated complexities, information visualization has become increasingly important. Researchers have found information visualization to be an effective way to use and understand information while minimizing a user's cognitive load. Our work was based on an algorithmic approach of concept discovery and association. Concepts are discovered using an algorithm based on an automated thesaurus generation procedure. Subsequently, similarities among terms are computed using the cosine measure, and the associations among terms are established using a method known as max-min distance clustering. The concept space is then visualized in a spring embedding graph, which roughly shows the semantic relationships among concepts in a 2-D visual representation. The semantic space of the visualization is used as a medium for users to retrieve the desired documents. In the remainder of this article, we present our algorithmic approach of concept generation and clustering, followed by description of the visualization technique and interactive interface. The paper ends with key conclusions and discussions on future work.