Search (23 results, page 1 of 2)

  • × author_ss:"White, H.D."
  1. White, H.D.; Lin, X.; McCain, K.W.: Two modes of automated domain analysis : multidimensional scaling vs. Kohonen feature mapping of information science authors (1998) 0.03
    0.029315805 = product of:
      0.10260531 = sum of:
        0.044992477 = weight(_text_:wide in 143) [ClassicSimilarity], result of:
          0.044992477 = score(doc=143,freq=2.0), product of:
            0.1312982 = queryWeight, product of:
              4.4307585 = idf(docFreq=1430, maxDocs=44218)
              0.029633347 = queryNorm
            0.342674 = fieldWeight in 143, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4307585 = idf(docFreq=1430, maxDocs=44218)
              0.0546875 = fieldNorm(doc=143)
        0.024409214 = weight(_text_:web in 143) [ClassicSimilarity], result of:
          0.024409214 = score(doc=143,freq=2.0), product of:
            0.09670874 = queryWeight, product of:
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.029633347 = queryNorm
            0.25239927 = fieldWeight in 143, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.0546875 = fieldNorm(doc=143)
        0.012233062 = weight(_text_:information in 143) [ClassicSimilarity], result of:
          0.012233062 = score(doc=143,freq=6.0), product of:
            0.052020688 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.029633347 = queryNorm
            0.23515764 = fieldWeight in 143, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0546875 = fieldNorm(doc=143)
        0.020970564 = weight(_text_:retrieval in 143) [ClassicSimilarity], result of:
          0.020970564 = score(doc=143,freq=2.0), product of:
            0.08963835 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.029633347 = queryNorm
            0.23394634 = fieldWeight in 143, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0546875 = fieldNorm(doc=143)
      0.2857143 = coord(4/14)
    
    Abstract
    This paper shows that, given co-citation data, Kohonen feature mapping produces results quite similar to those of multidimensional scaling, the traditional mode for computer-assisted mapping of intellectual domains. It further presents a Kohonen feature map based on author co-citation data that links author names to information about them on the World Wide Web. The results bear on a goal for present-day information science: the integration of computerized bibliometrics with document retrieval
  2. White, H.D.: Pathfinder networks and author cocitation analysis : a remapping of paradigmatic information scientists (2003) 0.01
    0.009107954 = product of:
      0.042503785 = sum of:
        0.017435152 = weight(_text_:web in 1459) [ClassicSimilarity], result of:
          0.017435152 = score(doc=1459,freq=2.0), product of:
            0.09670874 = queryWeight, product of:
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.029633347 = queryNorm
            0.18028519 = fieldWeight in 1459, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1459)
        0.010089659 = weight(_text_:information in 1459) [ClassicSimilarity], result of:
          0.010089659 = score(doc=1459,freq=8.0), product of:
            0.052020688 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.029633347 = queryNorm
            0.19395474 = fieldWeight in 1459, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1459)
        0.014978974 = weight(_text_:retrieval in 1459) [ClassicSimilarity], result of:
          0.014978974 = score(doc=1459,freq=2.0), product of:
            0.08963835 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.029633347 = queryNorm
            0.16710453 = fieldWeight in 1459, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1459)
      0.21428572 = coord(3/14)
    
    Abstract
    In their 1998 article "Visualizing a discipline: An author cocitation analysis of information science, 1972-1995," White and McCain used multidimensional scaling, hierarchical clustering, and factor analysis to display the specialty groupings of 120 highly-cited ("paradigmatic") information scientists. These statistical techniques are traditional in author cocitation analysis (ACA). It is shown here that a newer technique, Pathfinder Networks (PFNETs), has considerable advantages for ACA. In PFNETs, nodes represent authors, and explicit links represent weighted paths between nodes, the weights in this case being cocitation counts. The links can be drawn to exclude all but the single highest counts for author pairs, which reduces a network of authors to only the most salient relationships. When these are mapped, dominant authors can be defined as those with relatively many links to other authors (i.e., high degree centrality). Links between authors and dominant authors define specialties, and links between dominant authors connect specialties into a discipline. Maps are made with one rather than several computer routines and in one rather than many computer passes. Also, PFNETs can, and should, be generated from matrices of raw counts rather than Pearson correlations, which removes a computational step associated with traditional ACA. White and McCain's raw data from 1998 are remapped as a PFNET. It is shown that the specialty groupings correspond closely to those seen in the factor analysis of the 1998 article. Because PFNETs are fast to compute, they are used in AuthorLink, a new Web-based system that creates live interfaces for cocited author retrieval an the fly.
    Source
    Journal of the American Society for Information Science and technology. 54(2003) no.5, S.423-434
  3. White, H.D.: Literature retrieval for interdisciplinary syntheses (1996) 0.01
    0.007083241 = product of:
      0.049582683 = sum of:
        0.008071727 = weight(_text_:information in 7262) [ClassicSimilarity], result of:
          0.008071727 = score(doc=7262,freq=2.0), product of:
            0.052020688 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.029633347 = queryNorm
            0.1551638 = fieldWeight in 7262, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0625 = fieldNorm(doc=7262)
        0.041510954 = weight(_text_:retrieval in 7262) [ClassicSimilarity], result of:
          0.041510954 = score(doc=7262,freq=6.0), product of:
            0.08963835 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.029633347 = queryNorm
            0.46309367 = fieldWeight in 7262, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0625 = fieldNorm(doc=7262)
      0.14285715 = coord(2/14)
    
    Abstract
    Considers practical ways of performing interdisciplinary searches, particularly onlines searches, for subjects with the aim of retrieving literature outside the main discipline of the search topic. Discusses the use of bibliographic markers of various types and demonstrates DIALOG's RANK command as a means of revealing interdisciplinarity in any field. Considers retrieval techniques for searchers interested in synthesizing work from their own discipline (in the example, library and information science) with work from another disciplines. Discusses creativity, the connection of hitherto unconnected literatures, the retrieval and assessment of syntheses, and the nature of library browsing
  4. White, H.D.: Combining bibliometrics, information retrieval, and relevance theory : part 1: first examples of a synthesis (2007) 0.01
    0.006823277 = product of:
      0.047762938 = sum of:
        0.014268933 = weight(_text_:information in 436) [ClassicSimilarity], result of:
          0.014268933 = score(doc=436,freq=16.0), product of:
            0.052020688 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.029633347 = queryNorm
            0.27429342 = fieldWeight in 436, product of:
              4.0 = tf(freq=16.0), with freq of:
                16.0 = termFreq=16.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0390625 = fieldNorm(doc=436)
        0.033494003 = weight(_text_:retrieval in 436) [ClassicSimilarity], result of:
          0.033494003 = score(doc=436,freq=10.0), product of:
            0.08963835 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.029633347 = queryNorm
            0.37365708 = fieldWeight in 436, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0390625 = fieldNorm(doc=436)
      0.14285715 = coord(2/14)
    
    Abstract
    In Sperber and Wilson's relevance theory (RT), the ratio Cognitive Effects/Processing Effort defines the relevance of a communication. The tf*idf formula from information retrieval is used to operationalize this ratio for any item co-occurring with a user-supplied seed term in bibliometric distributions. The tf weight of the item predicts its effect on the user in the context of the seed term, and its idf weight predicts the user's processing effort in relating the item to the seed term. The idf measure, also known as statistical specificity, is shown to have unsuspected applications in quantifying interrelated concepts such as topical and nontopical relevance, levels of user expertise, and levels of authority. A new kind of visualization, the pennant diagram, illustrates these claims. The bibliometric distributions visualized are the works cocited with a seed work (Moby Dick), the authors cocited with a seed author (White HD, for maximum interpretability), and the books and articles cocited with a seed article (S.A. Harter's "Psychological Relevance and Information Science," which introduced RT to information scientists in 1992). Pennant diagrams use bibliometric data and information retrieval techniques on the system side to mimic a relevancetheoretic model of cognition on the user side. Relevance theory may thus influence the design of new visual information retrieval interfaces. Generally, when information retrieval and bibliometrics are interpreted in light of RT, the implications are rich: A single sociocognitive theory may serve to integrate research on literature-based systems with research on their users, areas now largely separate.
    Source
    Journal of the American Society for Information Science and Technology. 58(2007) no.4, S.536-559
  5. White, H.D.: Relevance theory and distributions of judgments in document retrieval (2017) 0.01
    0.0052989167 = product of:
      0.037092414 = sum of:
        0.0071344664 = weight(_text_:information in 5099) [ClassicSimilarity], result of:
          0.0071344664 = score(doc=5099,freq=4.0), product of:
            0.052020688 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.029633347 = queryNorm
            0.13714671 = fieldWeight in 5099, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5099)
        0.029957948 = weight(_text_:retrieval in 5099) [ClassicSimilarity], result of:
          0.029957948 = score(doc=5099,freq=8.0), product of:
            0.08963835 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.029633347 = queryNorm
            0.33420905 = fieldWeight in 5099, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5099)
      0.14285715 = coord(2/14)
    
    Abstract
    This article extends relevance theory (RT) from linguistic pragmatics into information retrieval. Using more than 50 retrieval experiments from the literature as examples, it applies RT to explain the frequency distributions of documents on relevance scales with three or more points. The scale points, which judges in experiments must consider in addition to queries and documents, are communications from researchers. In RT, the relevance of a communication varies directly with its cognitive effects and inversely with the effort of processing it. Researchers define and/or label the scale points to measure the cognitive effects of documents on judges. However, they apparently assume that all scale points as presented are equally easy for judges to process. Yet the notion that points cost variable effort explains fairly well the frequency distributions of judgments across them. By hypothesis, points that cost more effort are chosen by judges less frequently. Effort varies with the vagueness or strictness of scale-point labels and definitions. It is shown that vague scales tend to produce U- or V-shaped distributions, while strict scales tend to produce right-skewed distributions. These results reinforce the paper's more general argument that RT clarifies the concept of relevance in the dialogues of retrieval evaluation.
    Source
    Information processing and management. 53(2017) no.5, S.1080-1102
  6. White, H.D.: Combining bibliometrics, information retrieval, and relevance theory : part 2: some implications for information science (2007) 0.00
    0.0046377215 = product of:
      0.03246405 = sum of:
        0.011280581 = weight(_text_:information in 437) [ClassicSimilarity], result of:
          0.011280581 = score(doc=437,freq=10.0), product of:
            0.052020688 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.029633347 = queryNorm
            0.21684799 = fieldWeight in 437, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0390625 = fieldNorm(doc=437)
        0.021183468 = weight(_text_:retrieval in 437) [ClassicSimilarity], result of:
          0.021183468 = score(doc=437,freq=4.0), product of:
            0.08963835 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.029633347 = queryNorm
            0.23632148 = fieldWeight in 437, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0390625 = fieldNorm(doc=437)
      0.14285715 = coord(2/14)
    
    Abstract
    When bibliometric data are converted to term frequency (tf) and inverse document frequency (idf) values, plotted as pennant diagrams, and interpreted according to Sperber and Wilson's relevance theory (RT), the results evoke major variables of information science (IS). These include topicality, in the sense of intercohesion and intercoherence among texts; cognitive effects of texts in response to people's questions; people's levels of expertise as a precondition for cognitive effects; processing effort as textual or other messages are received; specificity of terms as it affects processing effort; relevance, defined in RT as the effects/effort ratio; and authority of texts and their authors. While such concerns figure automatically in dialogues between people, they become problematic when people create or use or judge literature-based information systems. The difficulty of achieving worthwhile cognitive effects and acceptable processing effort in human-system dialogues explains why relevance is the central concern of IS. Moreover, since relevant communication with both systems and unfamiliar people is uncertain, speakers tend to seek cognitive effects that cost them the least effort. Yet hearers need greater effort, often greater specificity, from speakers if their responses are to be highly relevant in their turn. This theme of mismatch manifests itself in vague reference questions, underdeveloped online searches, uncreative judging in retrieval evaluation trials, and perfunctory indexing. Another effect of least effort is a bias toward topical relevance over other kinds. RT can explain these outcomes as well as more adaptive ones. Pennant diagrams, applied here to a literature search and a Bradford-style journal analysis, can model them. Given RT and the right context, bibliometrics may predict psychometrics.
    Source
    Journal of the American Society for Information Science and Technology. 58(2007) no.4, S.583-605
  7. White, H.D.: Authors as citers over time (2001) 0.00
    0.004488223 = product of:
      0.03141756 = sum of:
        0.025709987 = weight(_text_:wide in 5581) [ClassicSimilarity], result of:
          0.025709987 = score(doc=5581,freq=2.0), product of:
            0.1312982 = queryWeight, product of:
              4.4307585 = idf(docFreq=1430, maxDocs=44218)
              0.029633347 = queryNorm
            0.1958137 = fieldWeight in 5581, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4307585 = idf(docFreq=1430, maxDocs=44218)
              0.03125 = fieldNorm(doc=5581)
        0.005707573 = weight(_text_:information in 5581) [ClassicSimilarity], result of:
          0.005707573 = score(doc=5581,freq=4.0), product of:
            0.052020688 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.029633347 = queryNorm
            0.10971737 = fieldWeight in 5581, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.03125 = fieldNorm(doc=5581)
      0.14285715 = coord(2/14)
    
    Abstract
    This study explores the tendency of authors to recite themselves and others in multiple works over time, using the insights gained to build citation theory. The set of all authors whom an author cites is defined as that author's citation identity. The study explains how to retrieve citation identities from the Institute for Scientific Information's files on Dialog and how to deal with idiosyncrasies of these files. As the author's oeuvre grows, the identity takes the form of a core-and-scatter distribution that may be divided into authors cited only once (unicitations) and authors cited at least twice (recitations). The latter group, especially those recited most frequently, are interpretable as symbols of a citer's main substantive concerns. As illustrated by the top recitees of eight information scientists, identities are intelligible, individualized, and wide-ranging. They are ego-centered without being egotistical. They are often affected by social ties between citers and citees, but the universal motivator seems to be the perceived relevance of the citees' works. Citing styles in identities differ: "scientific-paper style" authors recite heavily, adding to core; "bibliographic-essay style" authors are heavy on unicitations, adding to scatter; "literature-review style" authors do both at once. Identities distill aspects of citers' intellectual lives, such as orienting figures, interdisciplinary interests, bidisciplinary careers, and conduct in controversies. They can also be related to past schemes for classifying citations in categories such as positive-negative and perfunctory- organic; indeed, one author's frequent recitation of another, whether positive or negative, may be the readiest indicator of an organic relation between them. The shape of the core-and-scatter distribution of names in identities can be explained by the principle of least effort. Citers economize on effort by frequently reciting only a relatively small core of names in their identities. They also economize by frequent use of perfunctory citations, which require relatively little context, and infrequent use of negative citations, which require contexts more laborious to set
    Source
    Journal of the American Society for Information Science and technology. 52(2001) no.2, S.87-108
  8. Lin, X.; White, H.D.; Buzydlowski, J.: Real-time author co-citation mapping for online searching (2003) 0.00
    0.00406575 = product of:
      0.028460251 = sum of:
        0.0104854815 = weight(_text_:information in 1080) [ClassicSimilarity], result of:
          0.0104854815 = score(doc=1080,freq=6.0), product of:
            0.052020688 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.029633347 = queryNorm
            0.20156369 = fieldWeight in 1080, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.046875 = fieldNorm(doc=1080)
        0.01797477 = weight(_text_:retrieval in 1080) [ClassicSimilarity], result of:
          0.01797477 = score(doc=1080,freq=2.0), product of:
            0.08963835 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.029633347 = queryNorm
            0.20052543 = fieldWeight in 1080, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.046875 = fieldNorm(doc=1080)
      0.14285715 = coord(2/14)
    
    Abstract
    Author searching is traditionally based on the matching of name strings. Special characteristics of authors as personal names and subject indicators are not considered. This makes it difficult to identify a set of related authors or to group authors by subjects in retrieval systems. In this paper, we describe the design and implementation of a prototype visualization system to enhance author searching. The system, called AuthorLink, is based on author co-citation analysis and visualization mapping algorithms such as Kohonen's feature maps and Pathfinder networks. AuthorLink produces interactive author maps in real time from a database of 1.26 million records supplied by the Institute for Scientific Information. The maps show subject groupings and more fine-grained intellectual connections among authors. Through the interactive interface the user can take advantage of such information to refine queries and retrieve documents through point-and-click manipulation of the authors' names.
    Source
    Information processing and management. 39(2003) no.5, S.689-706
  9. White, H.D.; McCain, K.W.: Visualization of literatures (1997) 0.00
    0.004004761 = product of:
      0.028033325 = sum of:
        0.0070627616 = weight(_text_:information in 2291) [ClassicSimilarity], result of:
          0.0070627616 = score(doc=2291,freq=2.0), product of:
            0.052020688 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.029633347 = queryNorm
            0.13576832 = fieldWeight in 2291, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2291)
        0.020970564 = weight(_text_:retrieval in 2291) [ClassicSimilarity], result of:
          0.020970564 = score(doc=2291,freq=2.0), product of:
            0.08963835 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.029633347 = queryNorm
            0.23394634 = fieldWeight in 2291, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2291)
      0.14285715 = coord(2/14)
    
    Abstract
    State of the art review of recent models of literatures that offer visual clues to relationships among writings that are often based term occurences and co-occurences. Considers the advantages of 2 dimensional and 3 dimensional displays of relationships over other models; bibliographic models; editorial models; bibliometric models; user models; and synthetic models. Discusses the online visualization and offline visualizations and the problems of visualizing changing literatures in a static medium, such as hard copy print. Argues that insufficient attention has been paid to user friendly visual design with the related questions of new capabilities and scaling up to larger collections. Concludes with the hope that, in future, the same visualization interface used for bibliographic domain analysis will be used for document retrieval
    Source
    Annual review of information science and technology. 32(1997), S.99-168
  10. White, H.D.: Citation analysis : history (2009) 0.00
    0.0037468998 = product of:
      0.026228298 = sum of:
        0.0050448296 = weight(_text_:information in 3763) [ClassicSimilarity], result of:
          0.0050448296 = score(doc=3763,freq=2.0), product of:
            0.052020688 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.029633347 = queryNorm
            0.09697737 = fieldWeight in 3763, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3763)
        0.021183468 = weight(_text_:retrieval in 3763) [ClassicSimilarity], result of:
          0.021183468 = score(doc=3763,freq=4.0), product of:
            0.08963835 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.029633347 = queryNorm
            0.23632148 = fieldWeight in 3763, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3763)
      0.14285715 = coord(2/14)
    
    Abstract
    References from publications are at the same time citations to other publications. This entry introduces some of the practical uses of citation data in science and scholarship. At the individual level citations identify and permit the retrieval of specific editions of works, while also suggesting their subject matter, authority, and age. Through citation indexes, retrievals may include not only the earlier items referred to by a given work, but also the later items that cite that given work in turn. Some technical notes on retrieval are included here. Counts of citations received over time, and measures derived from them, reveal the varying impacts of works, authors, journals, organizations, and countries. This has obvious implications for the evaluation of, e.g., library collections, academics, research teams, and science policies. When treated as linkages between pairs of publications, references and citations reveal intellectual ties. Several kinds of links have been defined, such as cocitation, bibliographic coupling, and intercitation. In the aggregate, these links form networks that compactly suggest the intellectual histories of research specialties and disciplines, especially when the networks are visualized through mapping software. Citation analysis is of course not without critics, who have long pointed out imperfections in the data or in analytical techniques. However, the criticisms have generally been met by strong counterarguments from proponents.
    Source
    Encyclopedia of library and information sciences. 3rd ed. Ed.: M.J. Bates
  11. White, H.D.: Relevance in theory (2009) 0.00
    0.0030010925 = product of:
      0.021007646 = sum of:
        0.009024465 = weight(_text_:information in 3872) [ClassicSimilarity], result of:
          0.009024465 = score(doc=3872,freq=10.0), product of:
            0.052020688 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.029633347 = queryNorm
            0.1734784 = fieldWeight in 3872, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.03125 = fieldNorm(doc=3872)
        0.0119831795 = weight(_text_:retrieval in 3872) [ClassicSimilarity], result of:
          0.0119831795 = score(doc=3872,freq=2.0), product of:
            0.08963835 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.029633347 = queryNorm
            0.13368362 = fieldWeight in 3872, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.03125 = fieldNorm(doc=3872)
      0.14285715 = coord(2/14)
    
    Abstract
    Relevance is the central concept in information science because of its salience in designing and evaluating literature-based answering systems. It is salient when users seek information through human intermediaries, such as reference librarians, but becomes even more so when systems are automated and users must navigate them on their own. Designers of classic precomputer systems of the nineteenth and twentieth centuries appear to have been no less concerned with relevance than the information scientists of today. The concept has, however, proved difficult to define and operationalize. A common belief is that it is a relation between a user's request for information and the documents the system retrieves in response. Documents might be considered retrieval-worthy because they: 1) constitute evidence for or against a claim; 2) answer a question; or 3) simply match the request in topic. In practice, literature-based answering makes use of term-matching technology, and most evaluation of relevance has involved topical match as the primary criterion for acceptability. The standard table for evaluating the relation of retrieved documents to a request has only the values "relevant" and "not relevant," yet many analysts hold that relevance admits of degrees. Moreover, many analysts hold that users decide relevance on more dimensions than topical match. Who then can validly judge relevance? Is it only the person who put the request and who can evaluate a document on multiple dimensions? Or can surrogate judges perform this function on the basis of topicality? Such questions arise in a longstanding debate on whether relevance is objective or subjective. One proposal has been to reframe the debate in terms of relevance theory (imported from linguistic pragmatics), which makes relevance increase with a document's valuable cognitive effects and decrease with the effort needed to process it. This notion allows degree of topical match to contribute to relevance but allows other considerations to contribute as well. Since both cognitive effects and processing effort will differ across users, they can be taken as subjective, but users' decisions can also be objectively evaluated if the logic behind them is made explicit. Relevance seems problematical because the considerations that lead people to accept documents in literature searches, or to use them later in contexts such as citation, are seldom fully revealed. Once they are revealed, relevance may be seen as not only multidimensional and dynamic, but also understandable.
    Source
    Encyclopedia of library and information sciences. 3rd ed. Ed.: M.J. Bates
  12. MacCain, K.W.; White, H.D.; Griffith, B.C.: Comparing retrieval performance in online data bases (1987) 0.00
    0.0028605436 = product of:
      0.020023804 = sum of:
        0.0050448296 = weight(_text_:information in 1167) [ClassicSimilarity], result of:
          0.0050448296 = score(doc=1167,freq=2.0), product of:
            0.052020688 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.029633347 = queryNorm
            0.09697737 = fieldWeight in 1167, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1167)
        0.014978974 = weight(_text_:retrieval in 1167) [ClassicSimilarity], result of:
          0.014978974 = score(doc=1167,freq=2.0), product of:
            0.08963835 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.029633347 = queryNorm
            0.16710453 = fieldWeight in 1167, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1167)
      0.14285715 = coord(2/14)
    
    Source
    Information processing and management. 23(1987), S.539-553
  13. Buzydlowski, J.W.; White, H.D.; Lin, X.: Term Co-occurrence Analysis as an Interface for Digital Libraries (2002) 0.00
    0.001986865 = product of:
      0.02781611 = sum of:
        0.02781611 = product of:
          0.08344833 = sum of:
            0.08344833 = weight(_text_:22 in 1339) [ClassicSimilarity], result of:
              0.08344833 = score(doc=1339,freq=6.0), product of:
                0.103770934 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.029633347 = queryNorm
                0.804159 = fieldWeight in 1339, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=1339)
          0.33333334 = coord(1/3)
      0.071428575 = coord(1/14)
    
    Date
    22. 2.2003 17:25:39
    22. 2.2003 18:16:22
  14. White, H.D.: Bibliometric overview of information science (2009) 0.00
    0.0012892094 = product of:
      0.01804893 = sum of:
        0.01804893 = weight(_text_:information in 3753) [ClassicSimilarity], result of:
          0.01804893 = score(doc=3753,freq=10.0), product of:
            0.052020688 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.029633347 = queryNorm
            0.3469568 = fieldWeight in 3753, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0625 = fieldNorm(doc=3753)
      0.071428575 = coord(1/14)
    
    Abstract
    This entry presents an account of the core concerns of information science through such means as definitional sketches, identification of themes, historical notes, and bibliometric evidence, including a citation-based map of 121 prominent information scientists of the twentieth century. The attempt throughout is to give concrete and pithy descriptions, to provide numerous specific examples, and to take a critical view of certain received language and ideas in library and information science.
    Source
    Encyclopedia of library and information sciences. 3rd ed. Ed.: M.J. Bates
  15. White, H.D.: Scientist-poets wanted (1999) 0.00
    0.0012482718 = product of:
      0.017475804 = sum of:
        0.017475804 = weight(_text_:information in 4326) [ClassicSimilarity], result of:
          0.017475804 = score(doc=4326,freq=6.0), product of:
            0.052020688 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.029633347 = queryNorm
            0.3359395 = fieldWeight in 4326, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.078125 = fieldNorm(doc=4326)
      0.071428575 = coord(1/14)
    
    Content
    Beitrag eines Themenheftes: The 50th Anniversary of the Journal of the American Society for Information Science. Pt.2: Paradigms, models, and models of information science
    Source
    Journal of the American Society for Information Science. 50(1999) no.12, S.1051-1063
  16. White, H.D.; Griffith, B.C.: Quality of indexing in online data bases (1987) 0.00
    0.0011531039 = product of:
      0.016143454 = sum of:
        0.016143454 = weight(_text_:information in 772) [ClassicSimilarity], result of:
          0.016143454 = score(doc=772,freq=2.0), product of:
            0.052020688 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.029633347 = queryNorm
            0.3103276 = fieldWeight in 772, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.125 = fieldNorm(doc=772)
      0.071428575 = coord(1/14)
    
    Source
    Information processing and management. 23(1987), S.211-224
  17. White, H.D.; McCain, W.: Bibliometrics (1989) 0.00
    0.0011531039 = product of:
      0.016143454 = sum of:
        0.016143454 = weight(_text_:information in 341) [ClassicSimilarity], result of:
          0.016143454 = score(doc=341,freq=2.0), product of:
            0.052020688 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.029633347 = queryNorm
            0.3103276 = fieldWeight in 341, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.125 = fieldNorm(doc=341)
      0.071428575 = coord(1/14)
    
    Source
    Annual review of information science and technology. 24(1989), S.119-186
  18. White, H.D.; McCain, K.W.: Visualizing a discipline : an author co-citation analysis of information science, 1972-1995 (1998) 0.00
    9.986174E-4 = product of:
      0.013980643 = sum of:
        0.013980643 = weight(_text_:information in 5020) [ClassicSimilarity], result of:
          0.013980643 = score(doc=5020,freq=6.0), product of:
            0.052020688 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.029633347 = queryNorm
            0.2687516 = fieldWeight in 5020, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0625 = fieldNorm(doc=5020)
      0.071428575 = coord(1/14)
    
    Abstract
    Presents an extensive domain analysis of information science in terms of its authors. Names of those most frequently cited in 12 key journals from 1972 through 1995 were retrieved from Social SciSearch via Dialog. The top 120 were submitted to author co-citation analyzes, yielding automatic classifications relevant to histories of the field
    Source
    Journal of the American Society for Information Science. 49(1998) no.4, S.327-355
  19. White, H.D.; Bates, M.J.; Wilson, P.: For information specialists : interpretations of reference and bibliographic work (1992) 0.00
    8.64828E-4 = product of:
      0.012107591 = sum of:
        0.012107591 = weight(_text_:information in 7742) [ClassicSimilarity], result of:
          0.012107591 = score(doc=7742,freq=2.0), product of:
            0.052020688 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.029633347 = queryNorm
            0.23274569 = fieldWeight in 7742, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.09375 = fieldNorm(doc=7742)
      0.071428575 = coord(1/14)
    
  20. White, H.D.: Author cocitation analysis and pearson's r (2003) 0.00
    5.0960475E-4 = product of:
      0.0071344664 = sum of:
        0.0071344664 = weight(_text_:information in 2119) [ClassicSimilarity], result of:
          0.0071344664 = score(doc=2119,freq=4.0), product of:
            0.052020688 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.029633347 = queryNorm
            0.13714671 = fieldWeight in 2119, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2119)
      0.071428575 = coord(1/14)
    
    Abstract
    In their article "Requirements for a cocitation similarity measure, with special reference to Pearson's correlation coefficient," Ahlgren, Jarneving, and Rousseau fault traditional author cocitation analysis (ACA) for using Pearson's r as a measure of similarity between authors because it fails two tests of stability of measurement. The instabilities arise when rs are recalculated after a first coherent group of authors has been augmented by a second coherent group with whom the first has little or no cocitation. However, AJ&R neither cluster nor map their data to demonstrate how fluctuations in rs will mislead the analyst, and the problem they pose is remote from both theory and practice in traditional ACA. By entering their own rs into multidimensional scaling and clustering routines, I show that, despite r's fluctuations, clusters based an it are much the same for the combined groups as for the separate groups. The combined groups when mapped appear as polarized clumps of points in two-dimensional space, confirming that differences between the groups have become much more important than differences within the groups-an accurate portrayal of what has happened to the data. Moreover, r produces clusters and maps very like those based an other coefficients that AJ&R mention as possible replacements, such as a cosine similarity measure or a chi square dissimilarity measure. Thus, r performs well enough for the purposes of ACA. Accordingly, I argue that qualitative information revealing why authors are cocited is more important than the cautions proposed in the AJ&R critique. I include notes an topics such as handling the diagonal in author cocitation matrices, lognormalizing data, and testing r for significance.
    Source
    Journal of the American Society for Information Science and technology. 54(2003) no.13, S.1250-1259