Search (117 results, page 1 of 6)

  • × year_i:[2000 TO 2010}
  • × theme_ss:"Informetrie"
  1. Grazia Colonia; Dimmler, E.; Dresel, R.; Messner, C.; Krobath, A.; Petz, S.; Sypien, M.; Boxen, P. van; Harders, M.; Heuer, D.; Jordans, I.; Juchem, K.; Linnertz, M.; Mittelhuber, I.; Schwammel, S.; Schlögl, C.; Stock, W.G.: Informationswissenschaftliche Zeitschriften in szientometrischer Analyse (2002) 0.04
    0.042756718 = product of:
      0.085513435 = sum of:
        0.085513435 = product of:
          0.12827015 = sum of:
            0.069849186 = weight(_text_:i in 1075) [ClassicSimilarity], result of:
              0.069849186 = score(doc=1075,freq=4.0), product of:
                0.16931784 = queryWeight, product of:
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.044891298 = queryNorm
                0.41253293 = fieldWeight in 1075, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1075)
            0.058420956 = weight(_text_:c in 1075) [ClassicSimilarity], result of:
              0.058420956 = score(doc=1075,freq=4.0), product of:
                0.15484828 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.044891298 = queryNorm
                0.3772787 = fieldWeight in 1075, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1075)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
  2. Samoylenko, I.; Chao, T.-C.; Liu, W.-C.; Chen, C.-M.: Visualizing the scientific world and its evolution (2006) 0.03
    0.034554742 = product of:
      0.069109485 = sum of:
        0.069109485 = product of:
          0.10366423 = sum of:
            0.042335 = weight(_text_:i in 5911) [ClassicSimilarity], result of:
              0.042335 = score(doc=5911,freq=2.0), product of:
                0.16931784 = queryWeight, product of:
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.044891298 = queryNorm
                0.25003272 = fieldWeight in 5911, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.046875 = fieldNorm(doc=5911)
            0.061329227 = weight(_text_:c in 5911) [ClassicSimilarity], result of:
              0.061329227 = score(doc=5911,freq=6.0), product of:
                0.15484828 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.044891298 = queryNorm
                0.3960601 = fieldWeight in 5911, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.046875 = fieldNorm(doc=5911)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
  3. He, Z.-L.: International collaboration does not have greater epistemic authority (2009) 0.03
    0.03212121 = product of:
      0.06424242 = sum of:
        0.06424242 = product of:
          0.096363634 = sum of:
            0.05987073 = weight(_text_:i in 3122) [ClassicSimilarity], result of:
              0.05987073 = score(doc=3122,freq=4.0), product of:
                0.16931784 = queryWeight, product of:
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.044891298 = queryNorm
                0.35359967 = fieldWeight in 3122, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3122)
            0.036492907 = weight(_text_:22 in 3122) [ClassicSimilarity], result of:
              0.036492907 = score(doc=3122,freq=2.0), product of:
                0.15720168 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044891298 = queryNorm
                0.23214069 = fieldWeight in 3122, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3122)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Abstract
    The consistent finding that internationally coauthored papers are more heavily cited has led to a tacit agreement among politicians and scientists that international collaboration in scientific research should be particularly promoted. However, existing studies of research collaboration suffer from a major weakness in that the Thomson Reuters Web of Science until recently did not link author names with affiliation addresses. The general approach has been to hierarchically code papers into international paper, national paper, or local paper based on the address information. This hierarchical coding scheme severely understates the level and contribution of local or national collaboration on an internationally coauthored paper. In this research, I code collaboration variables by hand checking each paper in the sample, use two measures of a paper's impact, and try several regression models. I find that both international collaboration and local collaboration are positively and significantly associated with a paper's impact, but international collaboration does not have more epistemic authority than local collaboration. This result suggests that previous findings based on hierarchical coding might be misleading.
    Date
    26. 9.2009 11:22:05
  4. He, Q.: ¬A study of the strength indexes in co-word analysis (2000) 0.03
    0.031759724 = product of:
      0.06351945 = sum of:
        0.06351945 = product of:
          0.09527917 = sum of:
            0.05987073 = weight(_text_:i in 111) [ClassicSimilarity], result of:
              0.05987073 = score(doc=111,freq=4.0), product of:
                0.16931784 = queryWeight, product of:
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.044891298 = queryNorm
                0.35359967 = fieldWeight in 111, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.046875 = fieldNorm(doc=111)
            0.035408445 = weight(_text_:c in 111) [ClassicSimilarity], result of:
              0.035408445 = score(doc=111,freq=2.0), product of:
                0.15484828 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.044891298 = queryNorm
                0.22866541 = fieldWeight in 111, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.046875 = fieldNorm(doc=111)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Abstract
    Co-word analysis is a technique for detecting the knowledge structure of scientific literature and mapping the dynamics in a research field. It is used to count the co-occurrences of term pairs, compute the strength between term pairs, and map the research field by inserting terms and their linkages into a graphical structure according to the strength values. In previous co-word studies, there are two indexes used to measure the strength between term pairs in order to identify the major areas in a research field - the inclusion index (I) and the equivalence index (E). This study will conduct two co-word analysis experiments using the two indexes, respectively, and compare the results from the two experiments. The results show, due to the difference in their computation, index I is more likely to identify general subject areas in a research field while index E is more likely to identify subject areas at more specific levels
    Source
    Dynamism and stability in knowledge organization: Proceedings of the 6th International ISKO-Conference, 10-13 July 2000, Toronto, Canada. Ed.: C. Beghtol et al
  5. Chen, C.: CiteSpace II : detecting and visualizing emerging trends and transient patterns in scientific literature (2006) 0.02
    0.024046673 = product of:
      0.048093345 = sum of:
        0.048093345 = product of:
          0.072140016 = sum of:
            0.041729257 = weight(_text_:c in 5272) [ClassicSimilarity], result of:
              0.041729257 = score(doc=5272,freq=4.0), product of:
                0.15484828 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.044891298 = queryNorm
                0.2694848 = fieldWeight in 5272, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5272)
            0.030410757 = weight(_text_:22 in 5272) [ClassicSimilarity], result of:
              0.030410757 = score(doc=5272,freq=2.0), product of:
                0.15720168 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044891298 = queryNorm
                0.19345059 = fieldWeight in 5272, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5272)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Abstract
    This article describes the latest development of a generic approach to detecting and visualizing emerging trends and transient patterns in scientific literature. The work makes substantial theoretical and methodological contributions to progressive knowledge domain visualization. A specialty is conceptualized and visualized as a time-variant duality between two fundamental concepts in information science: research fronts and intellectual bases. A research front is defined as an emergent and transient grouping of concepts and underlying research issues. The intellectual base of a research front is its citation and co-citation footprint in scientific literature - an evolving network of scientific publications cited by research-front concepts. Kleinberg's (2002) burst-detection algorithm is adapted to identify emergent research-front concepts. Freeman's (1979) betweenness centrality metric is used to highlight potential pivotal points of paradigm shift over time. Two complementary visualization views are designed and implemented: cluster views and time-zone views. The contributions of the approach are that (a) the nature of an intellectual base is algorithmically and temporally identified by emergent research-front terms, (b) the value of a co-citation cluster is explicitly interpreted in terms of research-front concepts, and (c) visually prominent and algorithmically detected pivotal points substantially reduce the complexity of a visualized network. The modeling and visualization process is implemented in CiteSpace II, a Java application, and applied to the analysis of two research fields: mass extinction (1981-2004) and terrorism (1990-2003). Prominent trends and pivotal points in visualized networks were verified in collaboration with domain experts, who are the authors of pivotal-point articles. Practical implications of the work are discussed. A number of challenges and opportunities for future studies are identified.
    Date
    22. 7.2006 16:11:05
  6. Chen, C.-M.: Classification of scientific networks using aggregated journal-journal citation relations in the Journal Citation Reports (2008) 0.02
    0.021595402 = product of:
      0.043190803 = sum of:
        0.043190803 = product of:
          0.0647862 = sum of:
            0.035279166 = weight(_text_:i in 2690) [ClassicSimilarity], result of:
              0.035279166 = score(doc=2690,freq=2.0), product of:
                0.16931784 = queryWeight, product of:
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.044891298 = queryNorm
                0.20836058 = fieldWeight in 2690, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2690)
            0.029507035 = weight(_text_:c in 2690) [ClassicSimilarity], result of:
              0.029507035 = score(doc=2690,freq=2.0), product of:
                0.15484828 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.044891298 = queryNorm
                0.1905545 = fieldWeight in 2690, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2690)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Abstract
    I propose an approach to classifying scientific networks in terms of aggregated journal-journal citation relations of the ISI Journal Citation Reports using the affinity propagation method. This algorithm is applied to obtain the classification of SCI and SSCI journals by minimizing intracategory journal-journal (J-J) distances in the database, where distance between journals is calculated from the similarity of their annual citation patterns with a cutoff parameter, t, to restrain the maximal J-J distance. As demonstrated in the classification of SCI journals, classification of scientific networks with different resolution is possible by choosing proper values of t. Twenty journal categories in SCI are found to be stable despite a difference of an order of magnitude in t. In our classifications, the level of specificity of a category can be found by looking at its value of RJ (the average distance of members of a category to its representative journal), and relatedness of category members is implied by the value of DJ-J (the average DJ-J distance within a category). Our results are consistent with the ISI classification scheme, and the level of relatedness for most categories in our classification is higher than their counterpart in the ISI classification scheme.
  7. Ahlgren, P.; Jarneving, B.; Rousseau, R.: Requirements for a cocitation similarity measure, with special reference to Pearson's correlation coefficient (2003) 0.02
    0.017517313 = product of:
      0.035034627 = sum of:
        0.035034627 = product of:
          0.052551936 = sum of:
            0.028223332 = weight(_text_:i in 5171) [ClassicSimilarity], result of:
              0.028223332 = score(doc=5171,freq=2.0), product of:
                0.16931784 = queryWeight, product of:
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.044891298 = queryNorm
                0.16668847 = fieldWeight in 5171, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.03125 = fieldNorm(doc=5171)
            0.024328604 = weight(_text_:22 in 5171) [ClassicSimilarity], result of:
              0.024328604 = score(doc=5171,freq=2.0), product of:
                0.15720168 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044891298 = queryNorm
                0.15476047 = fieldWeight in 5171, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=5171)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Abstract
    Ahlgren, Jarneving, and. Rousseau review accepted procedures for author co-citation analysis first pointing out that since in the raw data matrix the row and column values are identical i,e, the co-citation count of two authors, there is no clear choice for diagonal values. They suggest the number of times an author has been co-cited with himself excluding self citation rather than the common treatment as zeros or as missing values. When the matrix is converted to a similarity matrix the normal procedure is to create a matrix of Pearson's r coefficients between data vectors. Ranking by r and by co-citation frequency and by intuition can easily yield three different orders. It would seem necessary that the adding of zeros to the matrix will not affect the value or the relative order of similarity measures but it is shown that this is not the case with Pearson's r. Using 913 bibliographic descriptions form the Web of Science of articles form JASIS and Scientometrics, authors names were extracted, edited and 12 information retrieval authors and 12 bibliometric authors each from the top 100 most cited were selected. Co-citation and r value (diagonal elements treated as missing) matrices were constructed, and then reconstructed in expanded form. Adding zeros can both change the r value and the ordering of the authors based upon that value. A chi-squared distance measure would not violate these requirements, nor would the cosine coefficient. It is also argued that co-citation data is ordinal data since there is no assurance of an absolute zero number of co-citations, and thus Pearson is not appropriate. The number of ties in co-citation data make the use of the Spearman rank order coefficient problematic.
    Date
    9. 7.2006 10:22:35
  8. Nicolaisen, J.: Citation analysis (2007) 0.02
    0.01621907 = product of:
      0.03243814 = sum of:
        0.03243814 = product of:
          0.09731442 = sum of:
            0.09731442 = weight(_text_:22 in 6091) [ClassicSimilarity], result of:
              0.09731442 = score(doc=6091,freq=2.0), product of:
                0.15720168 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044891298 = queryNorm
                0.61904186 = fieldWeight in 6091, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.125 = fieldNorm(doc=6091)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Date
    13. 7.2008 19:53:22
  9. Tonta, Y.; Ünal, Y.: Scatter of journals and literature obsolescence reflected in document delivery requests (2005) 0.02
    0.01597808 = product of:
      0.03195616 = sum of:
        0.03195616 = product of:
          0.047934234 = sum of:
            0.02360563 = weight(_text_:c in 3271) [ClassicSimilarity], result of:
              0.02360563 = score(doc=3271,freq=2.0), product of:
                0.15484828 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.044891298 = queryNorm
                0.1524436 = fieldWeight in 3271, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.03125 = fieldNorm(doc=3271)
            0.024328604 = weight(_text_:22 in 3271) [ClassicSimilarity], result of:
              0.024328604 = score(doc=3271,freq=2.0), product of:
                0.15720168 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044891298 = queryNorm
                0.15476047 = fieldWeight in 3271, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=3271)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Abstract
    In this paper we investigate the scattering of journals and literature obsolescence reflected in more than 137,000 document delivery requests submitted to a national document delivery service. We first summarize the major findings of the study with regards to the performance of the service. We then identify the "core" journals from which article requests were satisfied and address the following research questions: (a) Does the distribution of (core) journals conform to the Bradford's Law of Scattering? (b) Is there a relationship between usage of journals and impact factors, journals with high impact factors being used more often than the rest? (c) Is there a relationship between usage of journals and total citation counts, journals with high total citation counts being used more often than the rest? (d) What is the median age of use (half-life) of requested articles in general? (e) Do requested articles that appear in core journals get obsolete more slowly? (f) Is there a relationship between obsolescence and journal impact factors, journals with high impact factors being obsolete more slowly? (g) Is there a relationship between obsolescence and total citation counts, journals with high total citation counts being obsolete more slowly? Based an the analysis of findings, we found that the distribution of highly and moderately used journal titles conform to Bradford's Law. The median age of use was 8 years for all requested articles. Ninety percent of the articles requested were 21 years of age or younger. Articles that appeared in 168 core journal titles seem to get obsolete slightly more slowly than those of all titles. We observed no statistically significant correlations between the frequency of journal use and ISI journal impact factors, and between the frequency of journal use and ISI- (Institute for Scientific Information, Philadelphia, PA) cited half-lives for the most heavily used 168 core journal titles. There was a weak correlation between usage of journals and ISI-reported total citation counts. No statistically significant relationship was found between median age of use and journal impact factors and between median age of use and total citation counts. There was a weak negative correlation between ISI journal impact factors and cited half-lives of 168 core journals, and a weak correlation between ISI citation halflives and use half-lives of core journals. No correlation was found between cited half-lives of 168 core journals and their corresponding total citation counts as reported by ISI. Findings of the current study are discussed along with those of other studies.
    Date
    20. 3.2005 10:54:22
  10. Van der Veer Martens, B.: Do citation systems represent theories of truth? (2001) 0.01
    0.014335768 = product of:
      0.028671537 = sum of:
        0.028671537 = product of:
          0.086014606 = sum of:
            0.086014606 = weight(_text_:22 in 3925) [ClassicSimilarity], result of:
              0.086014606 = score(doc=3925,freq=4.0), product of:
                0.15720168 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044891298 = queryNorm
                0.54716086 = fieldWeight in 3925, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=3925)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Date
    22. 7.2006 15:22:28
  11. Liu, Z.; Wang, C.: Mapping interdisciplinarity in demography : a journal network analysis (2005) 0.01
    0.01376995 = product of:
      0.0275399 = sum of:
        0.0275399 = product of:
          0.0826197 = sum of:
            0.0826197 = weight(_text_:c in 4384) [ClassicSimilarity], result of:
              0.0826197 = score(doc=4384,freq=2.0), product of:
                0.15484828 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.044891298 = queryNorm
                0.5335526 = fieldWeight in 4384, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.109375 = fieldNorm(doc=4384)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  12. Lewison, G.: ¬The work of the Bibliometrics Research Group (City University) and associates (2005) 0.01
    0.012164302 = product of:
      0.024328604 = sum of:
        0.024328604 = product of:
          0.07298581 = sum of:
            0.07298581 = weight(_text_:22 in 4890) [ClassicSimilarity], result of:
              0.07298581 = score(doc=4890,freq=2.0), product of:
                0.15720168 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044891298 = queryNorm
                0.46428138 = fieldWeight in 4890, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=4890)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Date
    20. 1.2007 17:02:22
  13. Pulgarin, A.; Gil-Leiva, I.: Bibliometric analysis of the automatic indexing literature : 1956-2000 (2004) 0.01
    0.011641531 = product of:
      0.023283063 = sum of:
        0.023283063 = product of:
          0.069849186 = sum of:
            0.069849186 = weight(_text_:i in 2566) [ClassicSimilarity], result of:
              0.069849186 = score(doc=2566,freq=4.0), product of:
                0.16931784 = queryWeight, product of:
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.044891298 = queryNorm
                0.41253293 = fieldWeight in 2566, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2566)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Abstract
    We present a bibliometric study of a corpus of 839 bibliographic references about automatic indexing, covering the period 1956-2000. We analyse the distribution of authors and works, the obsolescence and its dispersion, and the distribution of the literature by topic, year, and source type. We conclude that: (i) there has been a constant interest on the part of researchers; (ii) the most studied topics were the techniques and methods employed and the general aspects of automatic indexing; (iii) the productivity of the authors does fit a Lotka distribution (Dmax=0.02 and critical value=0.054); (iv) the annual aging factor is 95%; and (v) the dispersion of the literature is low.
  14. Cothey, V.: Web-crawling reliability (2004) 0.01
    0.011641531 = product of:
      0.023283063 = sum of:
        0.023283063 = product of:
          0.069849186 = sum of:
            0.069849186 = weight(_text_:i in 3089) [ClassicSimilarity], result of:
              0.069849186 = score(doc=3089,freq=4.0), product of:
                0.16931784 = queryWeight, product of:
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.044891298 = queryNorm
                0.41253293 = fieldWeight in 3089, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=3089)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Abstract
    In this article, I investigate the reliability, in the social science sense, of collecting informetric data about the World Wide Web by Web crawling. The investigation includes a critical examination of the practice of Web crawling and contrasts the results of content crawling with the results of link crawling. It is shown that Web crawling by search engines is intentionally biased and selective. I also report the results of a [arge-scale experimental simulation of Web crawling that illustrates the effects of different crawling policies an data collection. It is concluded that the reliability of Web crawling as a data collection technique is improved by fuller reporting of relevant crawling policies.
  15. Gaulé, P.: Access to scientific literature in India (2009) 0.01
    0.011641531 = product of:
      0.023283063 = sum of:
        0.023283063 = product of:
          0.069849186 = sum of:
            0.069849186 = weight(_text_:i in 3108) [ClassicSimilarity], result of:
              0.069849186 = score(doc=3108,freq=4.0), product of:
                0.16931784 = queryWeight, product of:
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.044891298 = queryNorm
                0.41253293 = fieldWeight in 3108, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=3108)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Abstract
    This article uses an evidence-based approach to assess the difficulties faced by developing country scientists in accessing scientific literature. I compare the backward citation patterns of Swiss and Indian scientists in a database of 43,150 scientific papers published by scientists from either country in 2007. Controlling for fields and quality with citing journal fixed effects, I find that Indian scientists have shorter reference lists (-6%) and are more likely to cite articles from open access journals (+50%). Moreover, the difference in the length of the reference list is more pronounced in biology and medicine, where circulation of (free) preprints and conference proceedings is non-existent. Informal file-sharing practices among scientists mitigate the effects of access restrictions.
  16. White, H.D.: Author cocitation analysis and pearson's r (2003) 0.01
    0.010184219 = product of:
      0.020368438 = sum of:
        0.020368438 = product of:
          0.061105315 = sum of:
            0.061105315 = weight(_text_:i in 2119) [ClassicSimilarity], result of:
              0.061105315 = score(doc=2119,freq=6.0), product of:
                0.16931784 = queryWeight, product of:
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.044891298 = queryNorm
                0.36089116 = fieldWeight in 2119, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2119)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Abstract
    In their article "Requirements for a cocitation similarity measure, with special reference to Pearson's correlation coefficient," Ahlgren, Jarneving, and Rousseau fault traditional author cocitation analysis (ACA) for using Pearson's r as a measure of similarity between authors because it fails two tests of stability of measurement. The instabilities arise when rs are recalculated after a first coherent group of authors has been augmented by a second coherent group with whom the first has little or no cocitation. However, AJ&R neither cluster nor map their data to demonstrate how fluctuations in rs will mislead the analyst, and the problem they pose is remote from both theory and practice in traditional ACA. By entering their own rs into multidimensional scaling and clustering routines, I show that, despite r's fluctuations, clusters based an it are much the same for the combined groups as for the separate groups. The combined groups when mapped appear as polarized clumps of points in two-dimensional space, confirming that differences between the groups have become much more important than differences within the groups-an accurate portrayal of what has happened to the data. Moreover, r produces clusters and maps very like those based an other coefficients that AJ&R mention as possible replacements, such as a cosine similarity measure or a chi square dissimilarity measure. Thus, r performs well enough for the purposes of ACA. Accordingly, I argue that qualitative information revealing why authors are cocited is more important than the cautions proposed in the AJ&R critique. I include notes an topics such as handling the diagonal in author cocitation matrices, lognormalizing data, and testing r for significance.
  17. Pudovkin, A.I.; Garfield, E.: Algorithmic procedure for finding semantically related journals (2002) 0.01
    0.010184219 = product of:
      0.020368438 = sum of:
        0.020368438 = product of:
          0.061105315 = sum of:
            0.061105315 = weight(_text_:i in 5220) [ClassicSimilarity], result of:
              0.061105315 = score(doc=5220,freq=6.0), product of:
                0.16931784 = queryWeight, product of:
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.044891298 = queryNorm
                0.36089116 = fieldWeight in 5220, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5220)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Abstract
    Journal Citation Reports provides a classification of journals most heavily cited by a given journal and which most heavily cite that journal, but size variation is not taken into account. Pudovkin and Garfield suggest a procedure for meeting this difficulty. The relatedness of journal i to journal j is determined by the number of citations from journal i to journal j in a given year normalized by the product of the papers published in the j journal in that year times the number of references cited in the i journal in that year. A multiplier of ten to the sixth is suggested to bring the values into an easily perceptible range. While citations received depend upon the overall cumulative number of papers published by a journal, the current year is utilized since that data is available in JCR. Citations to current year papers would be quite low in most fields and thus not included. To produce the final index, the maximum of the A citing B value, and the B citing A value is chosen and used to indicate the closeness of the journals. The procedure is illustrated for the journal Genetics.
  18. Schreiber, M.: ¬An empirical investigation of the g-index for 26 physicists in comparison with the h-index, the A-index, and the R-index (2008) 0.01
    0.010184219 = product of:
      0.020368438 = sum of:
        0.020368438 = product of:
          0.061105315 = sum of:
            0.061105315 = weight(_text_:i in 1968) [ClassicSimilarity], result of:
              0.061105315 = score(doc=1968,freq=6.0), product of:
                0.16931784 = queryWeight, product of:
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.044891298 = queryNorm
                0.36089116 = fieldWeight in 1968, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1968)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Abstract
    J.E. Hirsch (2005) introduced the h-index to quantify an individual's scientific research output by the largest number h of a scientist's papers that received at least h citations. To take into account the highly skewed frequency distribution of citations, L. Egghe (2006a) proposed the g-index as an improvement of the h-index. I have worked out 26 practical cases of physicists from the Institute of Physics at Chemnitz University of Technology, and compare the h and g values in this study. It is demonstrated that the g-index discriminates better between different citation patterns. This also can be achieved by evaluating B.H. Jin's (2006) A-index, which reflects the average number of citations in the h-core, and interpreting it in conjunction with the h-index. h and A can be combined into the R-index to measure the h-core's citation intensity. I also have determined the A and R values for the 26 datasets. For a better comparison, I utilize interpolated indices. The correlations between the various indices as well as with the total number of papers and the highest citation counts are discussed. The largest Pearson correlation coefficient is found between g and R. Although the correlation between g and h is relatively strong, the arrangement of the datasets is significantly different depending on whether they are put into order according to the values of either h or g.
  19. Rowlands, I.: Journal diffusion factors : a new approach to measuring research influence (2002) 0.01
    0.009407777 = product of:
      0.018815555 = sum of:
        0.018815555 = product of:
          0.056446664 = sum of:
            0.056446664 = weight(_text_:i in 693) [ClassicSimilarity], result of:
              0.056446664 = score(doc=693,freq=2.0), product of:
                0.16931784 = queryWeight, product of:
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.044891298 = queryNorm
                0.33337694 = fieldWeight in 693, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.0625 = fieldNorm(doc=693)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  20. Bookstein, A.; Moed, H.; Yitzahki, M.: Measures of international collaboration in scientific literature : part I (2006) 0.01
    0.009407777 = product of:
      0.018815555 = sum of:
        0.018815555 = product of:
          0.056446664 = sum of:
            0.056446664 = weight(_text_:i in 985) [ClassicSimilarity], result of:
              0.056446664 = score(doc=985,freq=2.0), product of:
                0.16931784 = queryWeight, product of:
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.044891298 = queryNorm
                0.33337694 = fieldWeight in 985, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.0625 = fieldNorm(doc=985)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    

Authors

Languages

  • e 102
  • d 15

Types

  • a 114
  • el 1
  • m 1
  • r 1
  • x 1
  • More… Less…