Search (41 results, page 1 of 3)

  • × theme_ss:"Citation indexing"
  • × year_i:[2000 TO 2010}
  1. Lai, K.-K.; Wu, S.-J.: Using the patent co-citation approach to establish a new patent classification system (2005) 0.02
    0.02066828 = product of:
      0.09645197 = sum of:
        0.021217827 = weight(_text_:subject in 1013) [ClassicSimilarity], result of:
          0.021217827 = score(doc=1013,freq=2.0), product of:
            0.10738805 = queryWeight, product of:
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.03002521 = queryNorm
            0.19758089 = fieldWeight in 1013, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1013)
        0.03761707 = weight(_text_:classification in 1013) [ClassicSimilarity], result of:
          0.03761707 = score(doc=1013,freq=10.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.39339557 = fieldWeight in 1013, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1013)
        0.03761707 = weight(_text_:classification in 1013) [ClassicSimilarity], result of:
          0.03761707 = score(doc=1013,freq=10.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.39339557 = fieldWeight in 1013, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1013)
      0.21428572 = coord(3/14)
    
    Abstract
    The paper proposes a new approach to create a patent classification system to replace the IPC or UPC system for conducting patent analysis and management. The new approach is based on co-citation analysis of bibliometrics. The traditional approach for management of patents, which is based on either the IPC or UPC, is too general to meet the needs of specific industries. In addition, some patents are placed in incorrect categories, making it difficult for enterprises to carry out R&D planning, technology positioning, patent strategy-making and technology forecasting. Therefore, it is essential to develop a patent classification system that is adaptive to the characteristics of a specific industry. The analysis of this approach is divided into three phases. Phase I selects appropriate databases to conduct patent searches according to the subject and objective of this study and then select basic patents. Phase II uses the co-cited frequency of the basic patent pairs to assess their similarity. Phase III uses factor analysis to establish a classification system and assess the efficiency of the proposed approach. The main contribution of this approach is to develop a patent classification system based on patent similarities to assist patent manager in understanding the basic patents for a specific industry, the relationships among categories of technologies and the evolution of a technology category.
  2. Leydesdorff, L.: Clusters and maps of science journals based on bi-connected graphs in Journal Citation Reports (2004) 0.02
    0.017691448 = product of:
      0.08256009 = sum of:
        0.02546139 = weight(_text_:subject in 4427) [ClassicSimilarity], result of:
          0.02546139 = score(doc=4427,freq=2.0), product of:
            0.10738805 = queryWeight, product of:
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.03002521 = queryNorm
            0.23709705 = fieldWeight in 4427, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.046875 = fieldNorm(doc=4427)
        0.028549349 = weight(_text_:classification in 4427) [ClassicSimilarity], result of:
          0.028549349 = score(doc=4427,freq=4.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.29856625 = fieldWeight in 4427, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.046875 = fieldNorm(doc=4427)
        0.028549349 = weight(_text_:classification in 4427) [ClassicSimilarity], result of:
          0.028549349 = score(doc=4427,freq=4.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.29856625 = fieldWeight in 4427, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.046875 = fieldNorm(doc=4427)
      0.21428572 = coord(3/14)
    
    Abstract
    The aggregated journal-journal citation matrix derived from Journal Citation Reports 2001 can be decomposed into a unique subject classification using the graph-analytical algorithm of bi-connected components. This technique was recently incorporated in software tools for social network analysis. The matrix can be assessed in terms of its decomposability using articulation points which indicate overlap between the components. The articulation points of this set did not exhibit a next-order network of "general science" journals. However, the clusters differ in size and in terms of the internal density of their relations. A full classification of the journals is provided in the Appendix. The clusters can also be extracted and mapped for the visualization.
  3. Marion, L.S.; McCain, K.W.: Contrasting views of software engineering journals : author cocitation choices and indexer vocabulary assignments (2001) 0.01
    0.0121645145 = product of:
      0.0851516 = sum of:
        0.060013074 = weight(_text_:subject in 5767) [ClassicSimilarity], result of:
          0.060013074 = score(doc=5767,freq=16.0), product of:
            0.10738805 = queryWeight, product of:
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.03002521 = queryNorm
            0.55884314 = fieldWeight in 5767, product of:
              4.0 = tf(freq=16.0), with freq of:
                16.0 = termFreq=16.0
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5767)
        0.02513852 = weight(_text_:bibliographic in 5767) [ClassicSimilarity], result of:
          0.02513852 = score(doc=5767,freq=2.0), product of:
            0.11688946 = queryWeight, product of:
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.03002521 = queryNorm
            0.21506234 = fieldWeight in 5767, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5767)
      0.14285715 = coord(2/14)
    
    Abstract
    We explore the intellectual subject structure and research themes in software engineering through the identification and analysis of a core journal literature. We examine this literature via two expert perspectives: that of the author, who identified significant work by citing it (journal cocitation analysis), and that of the professional indexer, who tags published work with subject terms to facilitate retrieval from a bibliographic database (subject profile analysis). The data sources are SCISEARCH (the on-line version of Science Citation Index), and INSPEC (a database covering software engineering, computer science, and information systems). We use data visualization tools (cluster analysis, multidimensional scaling, and PFNets) to show the "intellectual maps" of software engineering. Cocitation and subject profile analyses demonstrate that software engineering is a distinct interdisciplinary field, valuing practical and applied aspects, and spanning a subject continuum from "programming-in-the-smalI" to "programming-in-the-large." This continuum mirrors the software development life cycle by taking the operating system or major application from initial programming through project management, implementation, and maintenance. Object orientation is an integral but distinct subject area in software engineering. Key differences are the importance of management and programming: (1) cocitation analysis emphasizes project management and systems development; (2) programming techniques/languages are more influential in subject profiles; (3) cocitation profiles place object-oriented journals separately and centrally while the subject profile analysis locates these journals with the programming/languages group
  4. Rousseau, R.; Zuccala, A.: ¬A classification of author co-citations : definitions and search strategies (2004) 0.01
    0.00832516 = product of:
      0.058276117 = sum of:
        0.029138058 = weight(_text_:classification in 2266) [ClassicSimilarity], result of:
          0.029138058 = score(doc=2266,freq=6.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.3047229 = fieldWeight in 2266, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2266)
        0.029138058 = weight(_text_:classification in 2266) [ClassicSimilarity], result of:
          0.029138058 = score(doc=2266,freq=6.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.3047229 = fieldWeight in 2266, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2266)
      0.14285715 = coord(2/14)
    
    Abstract
    The term author co-citation is defined and classified according to four distinct forms: the pure first-author co-citation, the pure author co-citation, the general author co-citation, and the special co-authorlco-citation. Each form can be used to obtain one count in an author co-citation study, based an a binary counting rule, which either recognizes the co-citedness of two authors in a given reference list (1) or does not (0). Most studies using author co-citations have relied solely an first-author cocitation counts as evidence of an author's oeuvre or body of work contributed to a research field. In this article, we argue that an author's contribution to a selected field of study should not be limited, but should be based an his/her complete list of publications, regardless of author ranking. We discuss the implications associated with using each co-citation form and show where simple first-author co-citations fit within our classification scheme. Examples are given to substantiate each author co-citation form defined in our classification, including a set of sample Dialog(TM) searches using references extracted from the SciSearch database.
  5. Thelwall, M.; Harries, G.: ¬The connection between the research of a university and counts of links to its Web pages : an investigation based upon a classification of the relationships of pages to the research of the host university (2003) 0.01
    0.0067291465 = product of:
      0.047104023 = sum of:
        0.023552012 = weight(_text_:classification in 1676) [ClassicSimilarity], result of:
          0.023552012 = score(doc=1676,freq=2.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.24630459 = fieldWeight in 1676, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1676)
        0.023552012 = weight(_text_:classification in 1676) [ClassicSimilarity], result of:
          0.023552012 = score(doc=1676,freq=2.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.24630459 = fieldWeight in 1676, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1676)
      0.14285715 = coord(2/14)
    
  6. White, H.D.: Citation analysis : history (2009) 0.01
    0.0066223354 = product of:
      0.046356346 = sum of:
        0.021217827 = weight(_text_:subject in 3763) [ClassicSimilarity], result of:
          0.021217827 = score(doc=3763,freq=2.0), product of:
            0.10738805 = queryWeight, product of:
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.03002521 = queryNorm
            0.19758089 = fieldWeight in 3763, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3763)
        0.02513852 = weight(_text_:bibliographic in 3763) [ClassicSimilarity], result of:
          0.02513852 = score(doc=3763,freq=2.0), product of:
            0.11688946 = queryWeight, product of:
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.03002521 = queryNorm
            0.21506234 = fieldWeight in 3763, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3763)
      0.14285715 = coord(2/14)
    
    Abstract
    References from publications are at the same time citations to other publications. This entry introduces some of the practical uses of citation data in science and scholarship. At the individual level citations identify and permit the retrieval of specific editions of works, while also suggesting their subject matter, authority, and age. Through citation indexes, retrievals may include not only the earlier items referred to by a given work, but also the later items that cite that given work in turn. Some technical notes on retrieval are included here. Counts of citations received over time, and measures derived from them, reveal the varying impacts of works, authors, journals, organizations, and countries. This has obvious implications for the evaluation of, e.g., library collections, academics, research teams, and science policies. When treated as linkages between pairs of publications, references and citations reveal intellectual ties. Several kinds of links have been defined, such as cocitation, bibliographic coupling, and intercitation. In the aggregate, these links form networks that compactly suggest the intellectual histories of research specialties and disciplines, especially when the networks are visualized through mapping software. Citation analysis is of course not without critics, who have long pointed out imperfections in the data or in analytical techniques. However, the criticisms have generally been met by strong counterarguments from proponents.
  7. Leydesdorff, L.: Can scientific journals be classified in terms of aggregated journal-journal citation relations using the Journal Citation Reports? (2006) 0.01
    0.00576784 = product of:
      0.04037488 = sum of:
        0.02018744 = weight(_text_:classification in 5046) [ClassicSimilarity], result of:
          0.02018744 = score(doc=5046,freq=2.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.21111822 = fieldWeight in 5046, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.046875 = fieldNorm(doc=5046)
        0.02018744 = weight(_text_:classification in 5046) [ClassicSimilarity], result of:
          0.02018744 = score(doc=5046,freq=2.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.21111822 = fieldWeight in 5046, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.046875 = fieldNorm(doc=5046)
      0.14285715 = coord(2/14)
    
    Abstract
    The aggregated citation relations among journals included in the Science Citation Index provide us with a huge matrix, which can be analyzed in various ways. By using principal component analysis or factor analysis, the factor scores can be employed as indicators of the position of the cited journals in the citing dimensions of the database. Unrotated factor scores are exact, and the extraction of principal components can be made stepwise because the principal components are independent. Rotation may be needed for the designation, but in the rotated solution a model is assumed. This assumption can be legitimated on pragmatic or theoretical grounds. Because the resulting outcomes remain sensitive to the assumptions in the model, an unambiguous classification is no longer possible in this case. However, the factor-analytic solutions allow us to test classifications against the structures contained in the database; in this article the process will be demonstrated for the delineation of a set of biochemistry journals.
  8. White, H.D.: Authors as citers over time (2001) 0.01
    0.0055871224 = product of:
      0.039109856 = sum of:
        0.01899904 = product of:
          0.03799808 = sum of:
            0.03799808 = weight(_text_:schemes in 5581) [ClassicSimilarity], result of:
              0.03799808 = score(doc=5581,freq=2.0), product of:
                0.16067243 = queryWeight, product of:
                  5.3512506 = idf(docFreq=569, maxDocs=44218)
                  0.03002521 = queryNorm
                0.2364941 = fieldWeight in 5581, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  5.3512506 = idf(docFreq=569, maxDocs=44218)
                  0.03125 = fieldNorm(doc=5581)
          0.5 = coord(1/2)
        0.020110816 = weight(_text_:bibliographic in 5581) [ClassicSimilarity], result of:
          0.020110816 = score(doc=5581,freq=2.0), product of:
            0.11688946 = queryWeight, product of:
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.03002521 = queryNorm
            0.17204987 = fieldWeight in 5581, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.03125 = fieldNorm(doc=5581)
      0.14285715 = coord(2/14)
    
    Abstract
    This study explores the tendency of authors to recite themselves and others in multiple works over time, using the insights gained to build citation theory. The set of all authors whom an author cites is defined as that author's citation identity. The study explains how to retrieve citation identities from the Institute for Scientific Information's files on Dialog and how to deal with idiosyncrasies of these files. As the author's oeuvre grows, the identity takes the form of a core-and-scatter distribution that may be divided into authors cited only once (unicitations) and authors cited at least twice (recitations). The latter group, especially those recited most frequently, are interpretable as symbols of a citer's main substantive concerns. As illustrated by the top recitees of eight information scientists, identities are intelligible, individualized, and wide-ranging. They are ego-centered without being egotistical. They are often affected by social ties between citers and citees, but the universal motivator seems to be the perceived relevance of the citees' works. Citing styles in identities differ: "scientific-paper style" authors recite heavily, adding to core; "bibliographic-essay style" authors are heavy on unicitations, adding to scatter; "literature-review style" authors do both at once. Identities distill aspects of citers' intellectual lives, such as orienting figures, interdisciplinary interests, bidisciplinary careers, and conduct in controversies. They can also be related to past schemes for classifying citations in categories such as positive-negative and perfunctory- organic; indeed, one author's frequent recitation of another, whether positive or negative, may be the readiest indicator of an organic relation between them. The shape of the core-and-scatter distribution of names in identities can be explained by the principle of least effort. Citers economize on effort by frequently reciting only a relatively small core of names in their identities. They also economize by frequent use of perfunctory citations, which require relatively little context, and infrequent use of negative citations, which require contexts more laborious to set
  9. Pudovkin, A.I.; Garfield, E.: Algorithmic procedure for finding semantically related journals (2002) 0.00
    0.004806533 = product of:
      0.03364573 = sum of:
        0.016822865 = weight(_text_:classification in 5220) [ClassicSimilarity], result of:
          0.016822865 = score(doc=5220,freq=2.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.17593184 = fieldWeight in 5220, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5220)
        0.016822865 = weight(_text_:classification in 5220) [ClassicSimilarity], result of:
          0.016822865 = score(doc=5220,freq=2.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.17593184 = fieldWeight in 5220, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5220)
      0.14285715 = coord(2/14)
    
    Abstract
    Journal Citation Reports provides a classification of journals most heavily cited by a given journal and which most heavily cite that journal, but size variation is not taken into account. Pudovkin and Garfield suggest a procedure for meeting this difficulty. The relatedness of journal i to journal j is determined by the number of citations from journal i to journal j in a given year normalized by the product of the papers published in the j journal in that year times the number of references cited in the i journal in that year. A multiplier of ten to the sixth is suggested to bring the values into an easily perceptible range. While citations received depend upon the overall cumulative number of papers published by a journal, the current year is utilized since that data is available in JCR. Citations to current year papers would be quite low in most fields and thus not included. To produce the final index, the maximum of the A citing B value, and the B citing A value is chosen and used to indicate the closeness of the journals. The procedure is illustrated for the journal Genetics.
  10. Sombatsompop, N.; Markpin, T.: Making an equality of ISI impact factors for different subject fields (2005) 0.00
    0.004066672 = product of:
      0.056933407 = sum of:
        0.056933407 = weight(_text_:subject in 3467) [ClassicSimilarity], result of:
          0.056933407 = score(doc=3467,freq=10.0), product of:
            0.10738805 = queryWeight, product of:
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.03002521 = queryNorm
            0.5301652 = fieldWeight in 3467, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.046875 = fieldNorm(doc=3467)
      0.071428575 = coord(1/14)
    
    Abstract
    The journal impact factors, published by the Institute for Scientific Information (ISI; Philadelphia, PA), are widely known and are used to evaluate overall journal quality and the quality of the papers published therein. However, when making comparisons between subject fields, the work of individual scientists and their research institutions as reflected in their articles' ISI impact factors can become meaningless. This inequality will remain as long as ISI impact factors are employed as an instrument to assess the quality of international research. Here we propose a new mathematical index entitled Impact Factor PointAverage (IFPA) for assessment of the quality of individual research work in different subject fields. The index is established based an a normalization of differences in impact factors, rankings, and number of journal titles in different subject fields. The proposed index is simple and enables the ISI impact factors to be used with equality, especially when evaluating the quality of research work in different subject fields.
  11. Ahlgren, P.; Jarneving, B.; Rousseau, R.: Requirements for a cocitation similarity measure, with special reference to Pearson's correlation coefficient (2003) 0.00
    0.00403526 = product of:
      0.028246818 = sum of:
        0.020110816 = weight(_text_:bibliographic in 5171) [ClassicSimilarity], result of:
          0.020110816 = score(doc=5171,freq=2.0), product of:
            0.11688946 = queryWeight, product of:
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.03002521 = queryNorm
            0.17204987 = fieldWeight in 5171, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.03125 = fieldNorm(doc=5171)
        0.008136002 = product of:
          0.016272005 = sum of:
            0.016272005 = weight(_text_:22 in 5171) [ClassicSimilarity], result of:
              0.016272005 = score(doc=5171,freq=2.0), product of:
                0.10514317 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03002521 = queryNorm
                0.15476047 = fieldWeight in 5171, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=5171)
          0.5 = coord(1/2)
      0.14285715 = coord(2/14)
    
    Abstract
    Ahlgren, Jarneving, and. Rousseau review accepted procedures for author co-citation analysis first pointing out that since in the raw data matrix the row and column values are identical i,e, the co-citation count of two authors, there is no clear choice for diagonal values. They suggest the number of times an author has been co-cited with himself excluding self citation rather than the common treatment as zeros or as missing values. When the matrix is converted to a similarity matrix the normal procedure is to create a matrix of Pearson's r coefficients between data vectors. Ranking by r and by co-citation frequency and by intuition can easily yield three different orders. It would seem necessary that the adding of zeros to the matrix will not affect the value or the relative order of similarity measures but it is shown that this is not the case with Pearson's r. Using 913 bibliographic descriptions form the Web of Science of articles form JASIS and Scientometrics, authors names were extracted, edited and 12 information retrieval authors and 12 bibliometric authors each from the top 100 most cited were selected. Co-citation and r value (diagonal elements treated as missing) matrices were constructed, and then reconstructed in expanded form. Adding zeros can both change the r value and the ordering of the authors based upon that value. A chi-squared distance measure would not violate these requirements, nor would the cosine coefficient. It is also argued that co-citation data is ordinal data since there is no assurance of an absolute zero number of co-citations, and thus Pearson is not appropriate. The number of ties in co-citation data make the use of the Spearman rank order coefficient problematic.
    Date
    9. 7.2006 10:22:35
  12. Gabel, J.: Improving information retrieval of subjects through citation-analysis (2006) 0.00
    0.0037123463 = product of:
      0.051972847 = sum of:
        0.051972847 = weight(_text_:subject in 61) [ClassicSimilarity], result of:
          0.051972847 = score(doc=61,freq=12.0), product of:
            0.10738805 = queryWeight, product of:
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.03002521 = queryNorm
            0.48397237 = fieldWeight in 61, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.0390625 = fieldNorm(doc=61)
      0.071428575 = coord(1/14)
    
    Abstract
    Citation-chasing is proposed as a method of discovering additional terms to enhance subject-search retrieval by broadening and prioritizing the results. Subjects attached to records representing cited works are compared to subjects attached to records representing the original citing sources, and to the subjects yielded by chasing see-also references from the latter group of headings. Original citing sources were yielded via a subject-list search in a library catalog using the subject heading "Language and languages - Origin." A subject-search was employed to avoid subjectivity in choosing sources. References from the sources were searched in OCLC where applicable, and the subject headings were retrieved. The subjects were ranked first by number of citations from original sources, then by total citation-frequency. The results were tiered into 4 groups in a Bradford-like distribution. A similar rank and division was performed on the subjects representing the original citing sources, and those yielded by chasing see-also references. Both in terms of subject frequency and topic type, positive comparisons between citation chasing and see-also references show a confirmation of different methods of yielding alternative subjects. Exclusive results suggest potential mutual complementary value among these different methods.
  13. Cronin, B.: Semiotics and evaluative bibliometrics (2000) 0.00
    0.0035912173 = product of:
      0.05027704 = sum of:
        0.05027704 = weight(_text_:bibliographic in 4542) [ClassicSimilarity], result of:
          0.05027704 = score(doc=4542,freq=2.0), product of:
            0.11688946 = queryWeight, product of:
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.03002521 = queryNorm
            0.43012467 = fieldWeight in 4542, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.078125 = fieldNorm(doc=4542)
      0.071428575 = coord(1/14)
    
    Abstract
    The reciprocal relationship between bibliographic references and citations in the context of the scholarly communication system is examined. Semiotic analysis of referencing behaviours and citation counting reveals the complexity of prevailing sign systems and associated symbolic practices.
  14. McVeigh, M.E.: Citation indexes and the Web of Science (2009) 0.00
    0.003047249 = product of:
      0.042661484 = sum of:
        0.042661484 = weight(_text_:bibliographic in 3848) [ClassicSimilarity], result of:
          0.042661484 = score(doc=3848,freq=4.0), product of:
            0.11688946 = queryWeight, product of:
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.03002521 = queryNorm
            0.3649729 = fieldWeight in 3848, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.046875 = fieldNorm(doc=3848)
      0.071428575 = coord(1/14)
    
    Abstract
    The Web of Science, an online database of bibliographic information produced by Thomson Reuters- draws its real value from the scholarly citation index at its core. By indexing the cited references from each paper as a separate part of the bibliographic data, a citation index creates a pathway by which a paper can be linked backward in time to the body of work that preceded it, as well as linked forward in time to its scholarly descendants. This entry provides a brief history of the development of the citation index, its core functionalities, and the way these unique data are provided to users through the Web of Science.
  15. Gabel, J.: Improving information retrieval of subjects through citation-analysis : a study (2006) 0.00
    0.002625025 = product of:
      0.03675035 = sum of:
        0.03675035 = weight(_text_:subject in 225) [ClassicSimilarity], result of:
          0.03675035 = score(doc=225,freq=6.0), product of:
            0.10738805 = queryWeight, product of:
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.03002521 = queryNorm
            0.34222013 = fieldWeight in 225, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.0390625 = fieldNorm(doc=225)
      0.071428575 = coord(1/14)
    
    Abstract
    Citation-chasing is proposed as a method of discovering additional terms to enhance subjectsearch retrieval. Subjects attached to OCLC records for cited works are compared to those attached to original citing sources. Citing sources were produced via a subject-list search in a library catalog using the LCSH "Language and languages-Origin." A subject-search was employed to avoid subjectivity in choosing sources. References from the sources were searched in OCLC where applicable, and the subject headings were retrieved. The subjects were ranked by citation-frequency and tiered into 3 groups in a Bradford-like distribution. Highly cited subjects were produced that were not revealed through the original search. A difference in relative importance among the subjects was also revealed. Broad extra-linguistic topics like evolution are more prominent than specific linguistic topics like phonology. There are exceptions, which appear somewhat predictable by the amount of imbalance in citation-representation among the 2 sources. Citation leaders were also produced for authors and secondary-source titles.
  16. Lin, X.; White, H.D.; Buzydlowski, J.: Real-time author co-citation mapping for online searching (2003) 0.00
    0.0025719889 = product of:
      0.036007844 = sum of:
        0.036007844 = weight(_text_:subject in 1080) [ClassicSimilarity], result of:
          0.036007844 = score(doc=1080,freq=4.0), product of:
            0.10738805 = queryWeight, product of:
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.03002521 = queryNorm
            0.33530587 = fieldWeight in 1080, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.046875 = fieldNorm(doc=1080)
      0.071428575 = coord(1/14)
    
    Abstract
    Author searching is traditionally based on the matching of name strings. Special characteristics of authors as personal names and subject indicators are not considered. This makes it difficult to identify a set of related authors or to group authors by subjects in retrieval systems. In this paper, we describe the design and implementation of a prototype visualization system to enhance author searching. The system, called AuthorLink, is based on author co-citation analysis and visualization mapping algorithms such as Kohonen's feature maps and Pathfinder networks. AuthorLink produces interactive author maps in real time from a database of 1.26 million records supplied by the Institute for Scientific Information. The maps show subject groupings and more fine-grained intellectual connections among authors. Through the interactive interface the user can take advantage of such information to refine queries and retrieve documents through point-and-click manipulation of the authors' names.
  17. Vaughan, L.; Shaw , D.: Bibliographic and Web citations : what Is the difference? (2003) 0.00
    0.002539374 = product of:
      0.035551235 = sum of:
        0.035551235 = weight(_text_:bibliographic in 5176) [ClassicSimilarity], result of:
          0.035551235 = score(doc=5176,freq=4.0), product of:
            0.11688946 = queryWeight, product of:
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.03002521 = queryNorm
            0.30414405 = fieldWeight in 5176, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5176)
      0.071428575 = coord(1/14)
    
    Abstract
    Vaughn, and Shaw look at the relationship between traditional citation and Web citation (not hyperlinks but rather textual mentions of published papers). Using English language research journals in ISI's 2000 Journal Citation Report - Information and Library Science category - 1209 full length papers published in 1997 in 46 journals were identified. Each was searched in Social Science Citation Index and on the Web using Google phrase search by entering the title in quotation marks, and followed for distinction where necessary with sub-titles, author's names, and journal title words. After removing obvious false drops, the number of web sites was recorded for comparison with the SSCI counts. A second sample from 1992 was also collected for examination. There were a total of 16,371 web citations to the selected papers. The top and bottom ranked four journals were then examined and every third citation to every third paper was selected and classified as to source type, domain, and country of origin. Web counts are much higher than ISI citation counts. Of the 46 journals from 1997, 26 demonstrated a significant correlation between Web and traditional citation counts, and 11 of the 15 in the 1992 sample also showed significant correlation. Journal impact factor in 1998 and 1999 correlated significantly with average Web citations per journal in the 1997 data, but at a low level. Thirty percent of web citations come from other papers posted on the web, and 30percent from listings of web based bibliographic services, while twelve percent come from class reading lists. High web citation journals often have web accessible tables of content.
  18. Leydesdorff, L.: Dynamic and evolutionary updates of classificatory schemes in scientific journal structures (2002) 0.00
    0.00237488 = product of:
      0.03324832 = sum of:
        0.03324832 = product of:
          0.06649664 = sum of:
            0.06649664 = weight(_text_:schemes in 1249) [ClassicSimilarity], result of:
              0.06649664 = score(doc=1249,freq=2.0), product of:
                0.16067243 = queryWeight, product of:
                  5.3512506 = idf(docFreq=569, maxDocs=44218)
                  0.03002521 = queryNorm
                0.41386467 = fieldWeight in 1249, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  5.3512506 = idf(docFreq=569, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1249)
          0.5 = coord(1/2)
      0.071428575 = coord(1/14)
    
  19. Nicolaisen, J.: Citation analysis (2007) 0.00
    0.0023245723 = product of:
      0.03254401 = sum of:
        0.03254401 = product of:
          0.06508802 = sum of:
            0.06508802 = weight(_text_:22 in 6091) [ClassicSimilarity], result of:
              0.06508802 = score(doc=6091,freq=2.0), product of:
                0.10514317 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03002521 = queryNorm
                0.61904186 = fieldWeight in 6091, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.125 = fieldNorm(doc=6091)
          0.5 = coord(1/2)
      0.071428575 = coord(1/14)
    
    Date
    13. 7.2008 19:53:22
  20. Marshakova-Shaikevich, I.: Bibliometric maps of field of science (2005) 0.00
    0.0021365185 = product of:
      0.029911257 = sum of:
        0.029911257 = product of:
          0.059822515 = sum of:
            0.059822515 = weight(_text_:texts in 1069) [ClassicSimilarity], result of:
              0.059822515 = score(doc=1069,freq=2.0), product of:
                0.16460659 = queryWeight, product of:
                  5.4822793 = idf(docFreq=499, maxDocs=44218)
                  0.03002521 = queryNorm
                0.36342722 = fieldWeight in 1069, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  5.4822793 = idf(docFreq=499, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1069)
          0.5 = coord(1/2)
      0.071428575 = coord(1/14)
    
    Abstract
    The present paper is devoted to two directions in algorithmic classificatory procedures: the journal co-citation analysis as an example of citation networks and lexical analysis of keywords in the titles and texts. What is common to those approaches is the general idea of normalization of deviations of the observed data from the mathematical expectation. The application of the same formula leads to discovery of statistically significant links between objects (journals in one case, keywords - in the other). The results of the journal co-citation analysis are reflected in tables and map for field "Women's Studies" and for field "Information Science and Library Science". An experimental attempt at establishing textual links between words was carried out on two samples from SSCI Data base: (1) EDUCATION and (2) ETHICS. The EDUCATION file included 2180 documents (of which 751 had abstracts); the ETHICS file included 807 documents (289 abstracts). Some examples of the results of this pilot study are given in tabular form . The binary links between words discovered in this way may form triplets or other groups with more than two member words.