Search (36 results, page 1 of 2)

Zhang, J.; Zeng, M.L.: ¬A new similarity measure for subject hierarchical structures (2014) 0.02
```
0.02155178 = product of:
  0.04310356 = sum of:
    0.04310356 = sum of:
      0.008938551 = weight(_text_:a in 1778) [ClassicSimilarity], result of:
        0.008938551 = score(doc=1778,freq=14.0), product of:
          0.053039093 = queryWeight, product of:
            1.153047 = idf(docFreq=37942, maxDocs=44218)
            0.045999072 = queryNorm
          0.1685276 = fieldWeight in 1778, product of:
            3.7416575 = tf(freq=14.0), with freq of:
              14.0 = termFreq=14.0
            1.153047 = idf(docFreq=37942, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1778)
      0.003003814 = weight(_text_:s in 1778) [ClassicSimilarity], result of:
        0.003003814 = score(doc=1778,freq=2.0), product of:
          0.05001192 = queryWeight, product of:
            1.0872376 = idf(docFreq=40523, maxDocs=44218)
            0.045999072 = queryNorm
          0.060061958 = fieldWeight in 1778, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            1.0872376 = idf(docFreq=40523, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1778)
      0.031161197 = weight(_text_:22 in 1778) [ClassicSimilarity], result of:
        0.031161197 = score(doc=1778,freq=2.0), product of:
          0.16108091 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.045999072 = queryNorm
          0.19345059 = fieldWeight in 1778, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1778)
  0.5 = coord(1/2)
```
Abstract

Purpose - The purpose of this paper is to introduce a new similarity method to gauge the differences between two subject hierarchical structures. Design/methodology/approach - In the proposed similarity measure, nodes on two hierarchical structures are projected onto a two-dimensional space, respectively, and both structural similarity and subject similarity of nodes are considered in the similarity between the two hierarchical structures. The extent to which the structural similarity impacts on the similarity can be controlled by adjusting a parameter. An experiment was conducted to evaluate soundness of the measure. Eight experts whose research interests were information retrieval and information organization participated in the study. Results from the new measure were compared with results from the experts. Findings - The evaluation shows strong correlations between the results from the new method and the results from the experts. It suggests that the similarity method achieved satisfactory results. Practical implications - Hierarchical structures that are found in subject directories, taxonomies, classification systems, and other classificatory structures play an extremely important role in information organization and information representation. Measuring the similarity between two subject hierarchical structures allows an accurate overarching understanding of the degree to which the two hierarchical structures are similar. Originality/value - Both structural similarity and subject similarity of nodes were considered in the proposed similarity method, and the extent to which the structural similarity impacts on the similarity can be adjusted. In addition, a new evaluation method for a hierarchical structure similarity was presented.

Date

8. 4.2015 16:22:13

Source

Journal of documentation. 70(2014) no.3, S.364-391

Type

a

Zhang, J.: TOFIR: A tool of facilitating information retrieval : introduce a visual retrieval model (2001) 0.01

0.008265105 = product of:
  0.01653021 = sum of:
    0.01653021 = product of:
      0.024795314 = sum of:
        0.016384635 = weight(_text_:a in 7711) [ClassicSimilarity], result of:
          0.016384635 = score(doc=7711,freq=6.0), product of:
            0.053039093 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045999072 = queryNorm
            0.3089162 = fieldWeight in 7711, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.109375 = fieldNorm(doc=7711)
        0.008410679 = weight(_text_:s in 7711) [ClassicSimilarity], result of:
          0.008410679 = score(doc=7711,freq=2.0), product of:
            0.05001192 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.045999072 = queryNorm
            0.16817348 = fieldWeight in 7711, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.109375 = fieldNorm(doc=7711)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Source: Information processing and management. 37(2001) no.4, S.639-657
Type: a

Patrick, J.; Zhang, J.; Artola-Zubillaga, X.: ¬An architecture and query language for a federation of heterogeneous dictionary databases (2000) 0.01

0.007262893 = product of:
  0.014525786 = sum of:
    0.014525786 = product of:
      0.021788679 = sum of:
        0.013377999 = weight(_text_:a in 339) [ClassicSimilarity], result of:
          0.013377999 = score(doc=339,freq=4.0), product of:
            0.053039093 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045999072 = queryNorm
            0.25222903 = fieldWeight in 339, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.109375 = fieldNorm(doc=339)
        0.008410679 = weight(_text_:s in 339) [ClassicSimilarity], result of:
          0.008410679 = score(doc=339,freq=2.0), product of:
            0.05001192 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.045999072 = queryNorm
            0.16817348 = fieldWeight in 339, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.109375 = fieldNorm(doc=339)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Source: Computers and the humanities. 35(2000), S.393-407
Type: a

Zhang, J.; Dimitroff, A.: Internet search engines' response to Metadata Dublin Core implementation (2005) 0.01

0.007262893 = product of:
  0.014525786 = sum of:
    0.014525786 = product of:
      0.021788679 = sum of:
        0.013377999 = weight(_text_:a in 4652) [ClassicSimilarity], result of:
          0.013377999 = score(doc=4652,freq=4.0), product of:
            0.053039093 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045999072 = queryNorm
            0.25222903 = fieldWeight in 4652, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.109375 = fieldNorm(doc=4652)
        0.008410679 = weight(_text_:s in 4652) [ClassicSimilarity], result of:
          0.008410679 = score(doc=4652,freq=2.0), product of:
            0.05001192 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.045999072 = queryNorm
            0.16817348 = fieldWeight in 4652, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.109375 = fieldNorm(doc=4652)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Source: Journal of information science. 30(2005) no.4, S.310-
Type: a

Zhang, J.: ¬A representational analysis of relational information displays (1996) 0.01

0.006369262 = product of:
  0.012738524 = sum of:
    0.012738524 = product of:
      0.019107785 = sum of:
        0.014301682 = weight(_text_:a in 6403) [ClassicSimilarity], result of:
          0.014301682 = score(doc=6403,freq=14.0), product of:
            0.053039093 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045999072 = queryNorm
            0.26964417 = fieldWeight in 6403, product of:
              3.7416575 = tf(freq=14.0), with freq of:
                14.0 = termFreq=14.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0625 = fieldNorm(doc=6403)
        0.0048061023 = weight(_text_:s in 6403) [ClassicSimilarity], result of:
          0.0048061023 = score(doc=6403,freq=2.0), product of:
            0.05001192 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.045999072 = queryNorm
            0.09609913 = fieldWeight in 6403, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.0625 = fieldNorm(doc=6403)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Abstract: Analyses graphic and tabular displays under a common, unified form - relational information displays (RIDs) which are displays that represent relations between dimensions. A representational taxonomy is developed that classifies all RIDs and serves as a framework for systematic studies of RIDs. Develops a taxonomy of RIDs which can classifiy the majority of dimension based display tasks and analyzes the relation between representations of displays and structures of tasks in terms of a mapping principle
Source: International journal of human-computer studies. 45(1996) no.1, S.59-74
Type: a

Zhang, J.; Korfhage, R.R.: DARE: Distance and Angle Retrieval Environment : A tale of the two measures (1999) 0.01

0.005631077 = product of:
  0.011262154 = sum of:
    0.011262154 = product of:
      0.01689323 = sum of:
        0.012087128 = weight(_text_:a in 3916) [ClassicSimilarity], result of:
          0.012087128 = score(doc=3916,freq=10.0), product of:
            0.053039093 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045999072 = queryNorm
            0.22789092 = fieldWeight in 3916, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0625 = fieldNorm(doc=3916)
        0.0048061023 = weight(_text_:s in 3916) [ClassicSimilarity], result of:
          0.0048061023 = score(doc=3916,freq=2.0), product of:
            0.05001192 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.045999072 = queryNorm
            0.09609913 = fieldWeight in 3916, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.0625 = fieldNorm(doc=3916)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Abstract: This article presents a visualization tool for information retrieval. Some retrieval evaluation models are interpreted in the two-dimensional space comprising direction and distance. The two different similarity measures-angle and distance-are displayed in the visual space. A new retrieval means based on the visual retrieval tool, the controlling bar, is developed for a search
Source: Journal of the American Society for Information Science. 50(1999) no.9, S.779-787
Type: a

Zhang, J.; Dimitroff, A.: ¬The impact of webpage content characteristics on webpage visibility in search engine results : part I (2005) 0.01

0.005631077 = product of:
  0.011262154 = sum of:
    0.011262154 = product of:
      0.01689323 = sum of:
        0.012087128 = weight(_text_:a in 1032) [ClassicSimilarity], result of:
          0.012087128 = score(doc=1032,freq=10.0), product of:
            0.053039093 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045999072 = queryNorm
            0.22789092 = fieldWeight in 1032, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0625 = fieldNorm(doc=1032)
        0.0048061023 = weight(_text_:s in 1032) [ClassicSimilarity], result of:
          0.0048061023 = score(doc=1032,freq=2.0), product of:
            0.05001192 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.045999072 = queryNorm
            0.09609913 = fieldWeight in 1032, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.0625 = fieldNorm(doc=1032)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Abstract: Content characteristics of a webpage include factors such as keyword position in a webpage, keyword duplication, layout, and their combination. These factors may impact webpage visibility in a search engine. Four hypotheses are presented relating to the impact of selected content characteristics on webpage visibility in search engine results lists. Webpage visibility can be improved by increasing the frequency of keywords in the title, in the full-text and in both the title and full-text.
Source: Information processing and management. 41(2005) no.3, S.665-690
Type: a

Zhang, J.; Dimitroff, A.: ¬The impact of metadata implementation on webpage visibility in search engine results : part II (2005) 0.01

0.005573104 = product of:
  0.011146208 = sum of:
    0.011146208 = product of:
      0.016719311 = sum of:
        0.012513972 = weight(_text_:a in 1027) [ClassicSimilarity], result of:
          0.012513972 = score(doc=1027,freq=14.0), product of:
            0.053039093 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045999072 = queryNorm
            0.23593865 = fieldWeight in 1027, product of:
              3.7416575 = tf(freq=14.0), with freq of:
                14.0 = termFreq=14.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1027)
        0.0042053396 = weight(_text_:s in 1027) [ClassicSimilarity], result of:
          0.0042053396 = score(doc=1027,freq=2.0), product of:
            0.05001192 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.045999072 = queryNorm
            0.08408674 = fieldWeight in 1027, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1027)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Abstract: This paper discusses the impact of metadata implementation in a webpage on its visibility performance in a search engine results list. Influential internal and external factors of metadata implementation were identified. How these factors affect webpage visibility in a search engine results list was examined in an experimental study. Findings suggest that metadata is a good mechanism to improve webpage visibility, the metadata subject field plays a more important role than any other metadata field and keywords extracted from the webpage itself, particularly title or full-text, are most effective. To maximize the effects, these keywords should come from both title and full-text.
Source: Information processing and management. 41(2005) no.3, S.691-716
Type: a

Zhang, J.; Dimitroff, A.: ¬The impact of metadata implementation on webpage visibility in search engine results : part II (2005) 0.01

0.005573104 = product of:
  0.011146208 = sum of:
    0.011146208 = product of:
      0.016719311 = sum of:
        0.012513972 = weight(_text_:a in 1033) [ClassicSimilarity], result of:
          0.012513972 = score(doc=1033,freq=14.0), product of:
            0.053039093 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045999072 = queryNorm
            0.23593865 = fieldWeight in 1033, product of:
              3.7416575 = tf(freq=14.0), with freq of:
                14.0 = termFreq=14.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1033)
        0.0042053396 = weight(_text_:s in 1033) [ClassicSimilarity], result of:
          0.0042053396 = score(doc=1033,freq=2.0), product of:
            0.05001192 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.045999072 = queryNorm
            0.08408674 = fieldWeight in 1033, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1033)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Abstract: This paper discusses the impact of metadata implementation in a webpage on its visibility performance in a search engine results list. Influential internal and external factors of metadata implementation were identified. How these factors affect webpage visibility in a search engine results list was examined in an experimental study. Findings suggest that metadata is a good mechanism to improve webpage visibility, the metadata subject field plays a more important role than any other metadata field and keywords extracted from the webpage itself, particularly title or full-text, are most effective. To maximize the effects, these keywords should come from both title and full-text.
Source: Information processing and management. 41(2005) no.3, S.691-715
Type: a

Zhang, J.; Nguyen, T.: WebStar: a visualization model for hyperlink structures (2005) 0.01

0.005255671 = product of:
  0.010511342 = sum of:
    0.010511342 = product of:
      0.015767014 = sum of:
        0.012162438 = weight(_text_:a in 1056) [ClassicSimilarity], result of:
          0.012162438 = score(doc=1056,freq=18.0), product of:
            0.053039093 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045999072 = queryNorm
            0.22931081 = fieldWeight in 1056, product of:
              4.2426405 = tf(freq=18.0), with freq of:
                18.0 = termFreq=18.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=1056)
        0.0036045765 = weight(_text_:s in 1056) [ClassicSimilarity], result of:
          0.0036045765 = score(doc=1056,freq=2.0), product of:
            0.05001192 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.045999072 = queryNorm
            0.072074346 = fieldWeight in 1056, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.046875 = fieldNorm(doc=1056)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Abstract: The authors introduce an information visualization model, WebStar, for hyperlink-based information systems. Hyperlinks within a hyperlink-based document can be visualized in a two-dimensional visual space. All links are projected within a display sphere in the visual space. The relationship between a specified central document and its hyperlinked documents is visually presented in the visual space. In addition, users are able to define a group of subjects and to observe relevance between each subject and all hyperlinked documents via movement of that subject around the display sphere center. WebStar allows users to dynamically change an interest center during navigation. A retrieval mechanism is developed to control retrieved results in the visual space. Impact of movement of a subject on the visual document distribution is analyzed. An ambiguity problem caused by projection is discussed. Potential applications of this visualization model in information retrieval are included. Future research directions on the topic are addressed.
Source: Information processing and management. 41(2005) no.4, S.1003-1018
Type: a

Wolfram, D.; Zhang, J.: ¬An investigation of the influence of indexing exhaustivity and term distributions on a document space (2002) 0.00
```
0.004736294 = product of:
  0.009472588 = sum of:
    0.009472588 = product of:
      0.014208882 = sum of:
        0.011205068 = weight(_text_:a in 5238) [ClassicSimilarity], result of:
          0.011205068 = score(doc=5238,freq=22.0), product of:
            0.053039093 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045999072 = queryNorm
            0.21126054 = fieldWeight in 5238, product of:
              4.690416 = tf(freq=22.0), with freq of:
                22.0 = termFreq=22.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5238)
        0.003003814 = weight(_text_:s in 5238) [ClassicSimilarity], result of:
          0.003003814 = score(doc=5238,freq=2.0), product of:
            0.05001192 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.045999072 = queryNorm
            0.060061958 = fieldWeight in 5238, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5238)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)
```
Abstract

Wolfram and Zhang are interested in the effect of different indexing exhaustivity, by which they mean the number of terms chosen, and of different index term distributions and different term weighting methods on the resulting document cluster organization. The Distance Angle Retrieval Environment, DARE, which provides a two dimensional display of retrieved documents was used to represent the document clusters based upon a document's distance from the searcher's main interest, and on the angle formed by the document, a point representing a minor interest, and the point representing the main interest. If the centroid and the origin of the document space are assigned as major and minor points the average distance between documents and the centroid can be measured providing an indication of cluster organization. in the form of a size normalized similarity measure. Using 500 records from NTIS and nine models created by intersecting low, observed, and high exhaustivity levels (based upon a negative binomial distribution) with shallow, observed, and steep term distributions (based upon a Zipf distribution) simulation runs were preformed using inverse document frequency, inter-document term frequency, and inverse document frequency based upon both inter and intra-document frequencies. Low exhaustivity and shallow distributions result in a more dense document space and less effective retrieval. High exhaustivity and steeper distributions result in a more diffuse space.

Source

Journal of the American Society for Information Science and Technology. 53(2002) no.11, S.944-952

Type

a

Zhang, J.; Korfhage, R.R.: ¬A distance and angle similarity measure method (1999) 0.00

0.0047229175 = product of:
  0.009445835 = sum of:
    0.009445835 = product of:
      0.014168751 = sum of:
        0.009362649 = weight(_text_:a in 3915) [ClassicSimilarity], result of:
          0.009362649 = score(doc=3915,freq=6.0), product of:
            0.053039093 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045999072 = queryNorm
            0.17652355 = fieldWeight in 3915, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0625 = fieldNorm(doc=3915)
        0.0048061023 = weight(_text_:s in 3915) [ClassicSimilarity], result of:
          0.0048061023 = score(doc=3915,freq=2.0), product of:
            0.05001192 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.045999072 = queryNorm
            0.09609913 = fieldWeight in 3915, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.0625 = fieldNorm(doc=3915)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Abstract: This article presents a distance and angle similarity measure. The integrated similarity measure takes the strenghts of both the distance and direction of measured documents into account. This article analyzes the features of the similarity measure by comparing it with the traditional distance-based similarity measure and the cosine measure, providing the iso-similarity contour, investigating the impacts of the parameters and variables on the new similarity measure. It also gives the further research issues on the topic
Source: Journal of the American Society for Information Science. 50(1999) no.9, S.772-778
Type: a

Zhang, J.; Wolfram, D.: Visualization of term discrimination analysis (2001) 0.00
```
0.0045624757 = product of:
  0.009124951 = sum of:
    0.009124951 = product of:
      0.013687426 = sum of:
        0.010683612 = weight(_text_:a in 5210) [ClassicSimilarity], result of:
          0.010683612 = score(doc=5210,freq=20.0), product of:
            0.053039093 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045999072 = queryNorm
            0.20142901 = fieldWeight in 5210, product of:
              4.472136 = tf(freq=20.0), with freq of:
                20.0 = termFreq=20.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5210)
        0.003003814 = weight(_text_:s in 5210) [ClassicSimilarity], result of:
          0.003003814 = score(doc=5210,freq=2.0), product of:
            0.05001192 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.045999072 = queryNorm
            0.060061958 = fieldWeight in 5210, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5210)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)
```
Abstract

Zang and Wolfram compute the discrimination value for terms as the difference between the centroid value of all terms in the corpus and that value without the term in question, and suggest selection be made by comparing density changes with a visualization tool. The Distance Angle Retrieval Environment (DARE) visually projects a document or term space by presenting distance similarity on the X axis and angular similarity on the Y axis. Thus a document icon appearing close to the X axis would be relevant to reference points in terms of a distance similarity measure, while those close to the Y axis are relevant to reference points in terms of an angle based measure. Using 450 Associated Press news reports indexed by 44 distinct terms, the removal of the term ``Yeltsin'' causes the cluster to fall on the Y axis indicating a good discriminator. For an angular measure, cosine say, movement along the X axis to the left will signal good discrimination, as movement to the right will signal poor discrimination. A term density space could also be used. Most terms are shown to be indifferent discriminators. Different measures result in different choices as good and poor discriminators, as does the use of a term space rather than a document space. The visualization approach is clearly feasible, and provides some additional insights not found in the computation of a discrimination value.

Source

Journal of the American Society for Information Science and technology. 52(2001) no.8, S.615-627

Type

a
Zhang, J.; An, L.; Tang, T.; Hong, Y.: Visual health subject directory analysis based on users' traversal activities (2009) 0.00
```
0.0045117214 = product of:
  0.009023443 = sum of:
    0.009023443 = product of:
      0.013535164 = sum of:
        0.009930588 = weight(_text_:a in 3112) [ClassicSimilarity], result of:
          0.009930588 = score(doc=3112,freq=12.0), product of:
            0.053039093 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045999072 = queryNorm
            0.18723148 = fieldWeight in 3112, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=3112)
        0.0036045765 = weight(_text_:s in 3112) [ClassicSimilarity], result of:
          0.0036045765 = score(doc=3112,freq=2.0), product of:
            0.05001192 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.045999072 = queryNorm
            0.072074346 = fieldWeight in 3112, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.046875 = fieldNorm(doc=3112)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)
```
Abstract

Concerns about health issues cover a wide spectrum. Consumer health information, which has become more available on the Internet, plays an extremely important role in addressing these concerns. A subject directory as an information organization and browsing mechanism is widely used in consumer health-related Websites. In this study we employed the information visualization technique Self-Organizing Map (SOM) in combination with a new U-matrix algorithm to analyze health subject clusters through a Web transaction log. An experimental study was conducted to test the proposed methods. The findings show that the clusters identified from the same cells based on path-length-1 outperformed both the clusters from the adjacent cells based on path-length-1 and the clusters from the same cells based on path-length-2 in the visual SOM display. The U-matrix method successfully distinguished the irrelevant subjects situated in the adjacent cells with different colors in the SOM display. The findings of this study lead to a better understanding of the health-related subject relationship from the users' traversal perspective.

Source

Journal of the American Society for Information Science and Technology. 60(2009) no.10, S.1977-1994

Type

a
Wolfram, D.; Zhang, J.: ¬The influence of indexing practices and weighting algorithms on document spaces (2008) 0.00
```
0.0042233076 = product of:
  0.008446615 = sum of:
    0.008446615 = product of:
      0.012669923 = sum of:
        0.009065346 = weight(_text_:a in 1963) [ClassicSimilarity], result of:
          0.009065346 = score(doc=1963,freq=10.0), product of:
            0.053039093 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045999072 = queryNorm
            0.1709182 = fieldWeight in 1963, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=1963)
        0.0036045765 = weight(_text_:s in 1963) [ClassicSimilarity], result of:
          0.0036045765 = score(doc=1963,freq=2.0), product of:
            0.05001192 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.045999072 = queryNorm
            0.072074346 = fieldWeight in 1963, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.046875 = fieldNorm(doc=1963)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)
```
Abstract

Index modeling and computer simulation techniques are used to examine the influence of indexing frequency distributions, indexing exhaustivity distributions, and three weighting methods on hypothetical document spaces in a vector-based information retrieval (IR) system. The way documents are indexed plays an important role in retrieval. The authors demonstrate the influence of different indexing characteristics on document space density (DSD) changes and document space discriminative capacity for IR. Document environments that contain a relatively higher percentage of infrequently occurring terms provide lower density outcomes than do environments where a higher percentage of frequently occurring terms exists. Different indexing exhaustivity levels, however, have little influence on the document space densities. A weighting algorithm that favors higher weights for infrequently occurring terms results in the lowest overall document space densities, which allows documents to be more readily differentiated from one another. This in turn can positively influence IR. The authors also discuss the influence on outcomes using two methods of normalization of term weights (i.e., means and ranges) for the different weighting methods.

Source

Journal of the American Society for Information Science and Technology. 59(2008) no.1, S.3-11

Type

a
Zhang, J.; Zhai, S.; Liu, H.; Stevenson, J.A.: Social network analysis on a topic-based navigation guidance system in a public health portal (2016) 0.00
```
0.004174508 = product of:
  0.008349016 = sum of:
    0.008349016 = product of:
      0.012523524 = sum of:
        0.00827549 = weight(_text_:a in 2887) [ClassicSimilarity], result of:
          0.00827549 = score(doc=2887,freq=12.0), product of:
            0.053039093 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045999072 = queryNorm
            0.15602624 = fieldWeight in 2887, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2887)
        0.004248034 = weight(_text_:s in 2887) [ClassicSimilarity], result of:
          0.004248034 = score(doc=2887,freq=4.0), product of:
            0.05001192 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.045999072 = queryNorm
            0.08494043 = fieldWeight in 2887, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2887)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)
```
Abstract

We investigated a topic-based navigation guidance system in the World Health Organization portal, compared the link connection network and the semantic connection network derived from the guidance system, analyzed the characteristics of the 2 networks from the perspective of the node centrality (in_closeness, out_closeness, betweenness, in_degree, and out_degree), and provided the suggestions to optimize and enhance the topic-based navigation guidance system. A mixed research method that combines the social network analysis method, clustering analysis method, and inferential analysis methods was used. The clustering analysis results of the link connection network were quite different from those of the semantic connection network. There were significant differences between the link connection network and the semantic network in terms of density and centrality. Inferential analysis results show that there were no strong correlations between the centrality of a node and its topic information characteristics. Suggestions for enhancing the navigation guidance system are discussed in detail. Future research directions, such as application of the same research method presented in this study to other similar public health portals, are also included.

Source

Journal of the Association for Information Science and Technology. 67(2016) no.5, S.1068-1088

Type

a

Geng, Q.; Townley, C.; Huang, K.; Zhang, J.: Comparative knowledge management : a pilot study of Chinese and American universities (2005) 0.00

0.0041325525 = product of:
  0.008265105 = sum of:
    0.008265105 = product of:
      0.012397657 = sum of:
        0.008192318 = weight(_text_:a in 3876) [ClassicSimilarity], result of:
          0.008192318 = score(doc=3876,freq=6.0), product of:
            0.053039093 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045999072 = queryNorm
            0.1544581 = fieldWeight in 3876, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3876)
        0.0042053396 = weight(_text_:s in 3876) [ClassicSimilarity], result of:
          0.0042053396 = score(doc=3876,freq=2.0), product of:
            0.05001192 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.045999072 = queryNorm
            0.08408674 = fieldWeight in 3876, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3876)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Abstract: Comparative study of knowledge management (KM) promises to lead to more effective knowledge use in all cultural environments. This pilot study compares KM priorities, needs, tools, and administrative structure components in large Chinese and American universities. General KM theory and literature related to KM in higher education are analyzed to develop the four components of the study. Comparative differences in KM practice at large Chinese and American universities are analyzed for each component. A correlation matrix reveals statistically significant co-variation among all but one of the study components. Four conclusions related to comparative KM and suggestions for future research are presented.
Source: Journal of the American Society for Information Science and Technology. 56(2005) no.10, S.1031-1044
Type: a

Zhang, J.; Jastram, I.: ¬A study of the metadata creation behavior of different user groups on the Internet (2006) 0.00

0.0041325525 = product of:
  0.008265105 = sum of:
    0.008265105 = product of:
      0.012397657 = sum of:
        0.008192318 = weight(_text_:a in 982) [ClassicSimilarity], result of:
          0.008192318 = score(doc=982,freq=6.0), product of:
            0.053039093 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045999072 = queryNorm
            0.1544581 = fieldWeight in 982, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=982)
        0.0042053396 = weight(_text_:s in 982) [ClassicSimilarity], result of:
          0.0042053396 = score(doc=982,freq=2.0), product of:
            0.05001192 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.045999072 = queryNorm
            0.08408674 = fieldWeight in 982, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.0546875 = fieldNorm(doc=982)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Abstract: Metadata is designed to improve information organization and information retrieval effectiveness and efficiency on the Internet. The way web publishers respond to metadata and the way they use it when publishing their web pages, however, is still a mystery. The authors of this paper aim to solve this mystery by defining different professional publisher groups, examining the behaviors of these user groups, and identifying the characteristics of their metadata use. This study will enhance the current understanding of metadata application behavior and provide evidence useful to researchers, web publishers, and search engine designers.
Source: Information processing and management. 42(2006) no.4, S.1099-1122
Type: a

Zhang, J.; Zhai, S.; Stevenson, J.A.; Xia, L.: Optimization of the subject directory in a government agriculture department web portal (2016) 0.00
```
0.003934163 = product of:
  0.007868326 = sum of:
    0.007868326 = product of:
      0.011802489 = sum of:
        0.0075544547 = weight(_text_:a in 3088) [ClassicSimilarity], result of:
          0.0075544547 = score(doc=3088,freq=10.0), product of:
            0.053039093 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045999072 = queryNorm
            0.14243183 = fieldWeight in 3088, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3088)
        0.004248034 = weight(_text_:s in 3088) [ClassicSimilarity], result of:
          0.004248034 = score(doc=3088,freq=4.0), product of:
            0.05001192 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.045999072 = queryNorm
            0.08494043 = fieldWeight in 3088, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3088)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)
```
Abstract

We investigated a subject directory in the US Agriculture Department-Economic Research Service portal. Parent-child relationships, related connections among the categories, and related connections among the subcategories in the subject directory were optimized using social network analysis. The optimization results were assessed by both density analysis and edge strength analysis methods. In addition, the results were evaluated by domain experts. From this study, it is recommended that four subcategories be switched from their original four categories into two different categories as a result of the parent-child relationship optimization.?It is also recommended that 132 subcategories be moved to 40 subcategories and that eight categories be moved to two categories as a result of the related connection optimization. The findings show that optimization boosted the densities of the optimized categories, and the recommended connections of both the related categories and subcategories were stronger than the existing connections of the related categories and subcategories. This paper provides visual displays of the optimization analysis as well as suggestions to enhance the subject directory of this portal.

Source

Journal of the Association for Information Science and Technology. 67(2016) no.9, S.2166-2180

Type

a
Zhang, J.; Yu, Q.; Zheng, F.; Long, C.; Lu, Z.; Duan, Z.: Comparing keywords plus of WOS and author keywords : a case study of patient adherence research (2016) 0.00
```
0.0039042893 = product of:
  0.0078085787 = sum of:
    0.0078085787 = product of:
      0.011712868 = sum of:
        0.008108292 = weight(_text_:a in 2857) [ClassicSimilarity], result of:
          0.008108292 = score(doc=2857,freq=8.0), product of:
            0.053039093 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045999072 = queryNorm
            0.15287387 = fieldWeight in 2857, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=2857)
        0.0036045765 = weight(_text_:s in 2857) [ClassicSimilarity], result of:
          0.0036045765 = score(doc=2857,freq=2.0), product of:
            0.05001192 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.045999072 = queryNorm
            0.072074346 = fieldWeight in 2857, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.046875 = fieldNorm(doc=2857)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)
```
Abstract

Bibliometric analysis based on literature in the Web of Science (WOS) has become an increasingly popular method for visualizing the structure of scientific fields. Keywords Plus and Author Keywords are commonly selected as units of analysis, despite the limited research evidence demonstrating the effectiveness of Keywords Plus. This study was conceived to evaluate the efficacy of Keywords Plus as a parameter for capturing the content and scientific concepts presented in articles. Using scientific papers about patient adherence that were retrieved from WOS, a comparative assessment of Keywords Plus and Author Keywords was performed at the scientific field level and the document level, respectively. Our search yielded more Keywords Plus terms than Author Keywords, and the Keywords Plus terms were more broadly descriptive. Keywords Plus is as effective as Author Keywords in terms of bibliometric analysis investigating the knowledge structure of scientific fields, but it is less comprehensive in representing an article's content.

Source

Journal of the Association for Information Science and Technology. 67(2016) no.4, S.967-972

Type

a

Search (36 results, page 1 of 2)

Authors

Years

Types

Themes