Search (9 results, page 1 of 1)

Vaughan, L.; Shaw , D.: Bibliographic and Web citations : what Is the difference? (2003) 0.06
```
0.06401929 = product of:
  0.19205788 = sum of:
    0.19205788 = weight(_text_:citation in 5176) [ClassicSimilarity], result of:
      0.19205788 = score(doc=5176,freq=20.0), product of:
        0.23445003 = queryWeight, product of:
          4.6892867 = idf(docFreq=1104, maxDocs=44218)
          0.04999695 = queryNorm
        0.8191847 = fieldWeight in 5176, product of:
          4.472136 = tf(freq=20.0), with freq of:
            20.0 = termFreq=20.0
          4.6892867 = idf(docFreq=1104, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5176)
  0.33333334 = coord(1/3)
```
Abstract

Vaughn, and Shaw look at the relationship between traditional citation and Web citation (not hyperlinks but rather textual mentions of published papers). Using English language research journals in ISI's 2000 Journal Citation Report - Information and Library Science category - 1209 full length papers published in 1997 in 46 journals were identified. Each was searched in Social Science Citation Index and on the Web using Google phrase search by entering the title in quotation marks, and followed for distinction where necessary with sub-titles, author's names, and journal title words. After removing obvious false drops, the number of web sites was recorded for comparison with the SSCI counts. A second sample from 1992 was also collected for examination. There were a total of 16,371 web citations to the selected papers. The top and bottom ranked four journals were then examined and every third citation to every third paper was selected and classified as to source type, domain, and country of origin. Web counts are much higher than ISI citation counts. Of the 46 journals from 1997, 26 demonstrated a significant correlation between Web and traditional citation counts, and 11 of the 15 in the 1992 sample also showed significant correlation. Journal impact factor in 1998 and 1999 correlated significantly with average Web citations per journal in the 1997 data, but at a low level. Thirty percent of web citations come from other papers posted on the web, and 30percent from listings of web based bibliographic services, while twelve percent come from class reading lists. High web citation journals often have web accessible tables of content.

Object

Journal Citation Report

Theme

Citation indexing
Vaughan, L.; Shaw, D.: Web citation data for impact assessment : a comparison of four science disciplines (2005) 0.05
```
0.049589127 = product of:
  0.14876738 = sum of:
    0.14876738 = weight(_text_:citation in 3880) [ClassicSimilarity], result of:
      0.14876738 = score(doc=3880,freq=12.0), product of:
        0.23445003 = queryWeight, product of:
          4.6892867 = idf(docFreq=1104, maxDocs=44218)
          0.04999695 = queryNorm
        0.6345377 = fieldWeight in 3880, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          4.6892867 = idf(docFreq=1104, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3880)
  0.33333334 = coord(1/3)
```
Abstract

The number and type of Web citations to journal articles in four areas of science are examined: biology, genetics, medicine, and multidisciplinary sciences. For a sample of 5,972 articles published in 114 journals, the median Web citation counts per journal article range from 6.2 in medicine to 10.4 in genetics. About 30% of Web citations in each area indicate intellectual impact (citations from articles or class readings, in contrast to citations from bibliographic services or the author's or journal's home page). Journals receiving more Web citations also have higher percentages of citations indicating intellectual impact. There is significant correlation between the number of citations reported in the databases from the Institute for Scientific Information (ISI, now Thomson Scientific) and the number of citations retrieved using the Google search engine (Web citations). The correlation is much weaker for journals published outside the United Kingdom or United States and for multidisciplinary journals. Web citation numbers are higher than ISI citation counts, suggesting that Web searches might be conducted for an earlier or a more fine-grained assessment of an article's impact. The Web-evident impact of non-UK/USA publications might provide a balance to the geographic or cultural biases observed in ISI's data, although the stability of Web citation counts is debatable.

Theme

Citation indexing
Leydesdorff, L.; Vaughan, L.: Co-occurrence matrices and their applications in information science : extending ACA to the Web environment (2006) 0.04
```
0.035064813 = product of:
  0.105194435 = sum of:
    0.105194435 = weight(_text_:citation in 6113) [ClassicSimilarity], result of:
      0.105194435 = score(doc=6113,freq=6.0), product of:
        0.23445003 = queryWeight, product of:
          4.6892867 = idf(docFreq=1104, maxDocs=44218)
          0.04999695 = queryNorm
        0.44868594 = fieldWeight in 6113, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          4.6892867 = idf(docFreq=1104, maxDocs=44218)
          0.0390625 = fieldNorm(doc=6113)
  0.33333334 = coord(1/3)
```
Abstract

Co-occurrence matrices, such as cocitation, coword, and colink matrices, have been used widely in the information sciences. However, confusion and controversy have hindered the proper statistical analysis of these data. The underlying problem, in our opinion, involved understanding the nature of various types of matrices. This article discusses the difference between a symmetrical cocitation matrix and an asymmetrical citation matrix as well as the appropriate statistical techniques that can be applied to each of these matrices, respectively. Similarity measures (such as the Pearson correlation coefficient or the cosine) should not be applied to the symmetrical cocitation matrix but can be applied to the asymmetrical citation matrix to derive the proximity matrix. The argument is illustrated with examples. The study then extends the application of co-occurrence matrices to the Web environment, in which the nature of the available data and thus data collection methods are different from those of traditional databases such as the Science Citation Index. A set of data collected with the Google Scholar search engine is analyzed by using both the traditional methods of multivariate analysis and the new visualization software Pajek, which is based on social network analysis and graph theory.
Thelwall, M.; Vaughan, L.; Björneborn, L.: Webometrics (2004) 0.03
```
0.028630298 = product of:
  0.08589089 = sum of:
    0.08589089 = weight(_text_:citation in 4279) [ClassicSimilarity], result of:
      0.08589089 = score(doc=4279,freq=4.0), product of:
        0.23445003 = queryWeight, product of:
          4.6892867 = idf(docFreq=1104, maxDocs=44218)
          0.04999695 = queryNorm
        0.36635053 = fieldWeight in 4279, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          4.6892867 = idf(docFreq=1104, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4279)
  0.33333334 = coord(1/3)
```
Abstract

Webometrics, the quantitative study of Web-related phenomena, emerged from the realization that methods originally designed for bibliometric analysis of scientific journal article citation patterns could be applied to the Web, with commercial search engines providing the raw data. Almind and Ingwersen (1997) defined the field and gave it its name. Other pioneers included Rodriguez Gairin (1997) and Aguillo (1998). Larson (1996) undertook exploratory link structure analysis, as did Rousseau (1997). Webometrics encompasses research from fields beyond information science such as communication studies, statistical physics, and computer science. In this review we concentrate on link analysis, but also cover other aspects of webometrics, including Web log fle analysis. One theme that runs through this chapter is the messiness of Web data and the need for data cleansing heuristics. The uncontrolled Web creates numerous problems in the interpretation of results, for instance, from the automatic creation or replication of links. The loose connection between top-level domain specifications (e.g., com, edu, and org) and their actual content is also a frustrating problem. For example, many .com sites contain noncommercial content, although com is ostensibly the main commercial top-level domain. Indeed, a skeptical researcher could claim that obstacles of this kind are so great that all Web analyses lack value. As will be seen, one response to this view, a view shared by critics of evaluative bibliometrics, is to demonstrate that Web data correlate significantly with some non-Web data in order to prove that the Web data are not wholly random. A practical response has been to develop increasingly sophisticated data cleansing techniques and multiple data analysis methods.

Theme

Citation indexing
Vaughan, L.: Visualizing linguistic and cultural differences using Web co-link data (2006) 0.02
```
0.024293613 = product of:
  0.072880834 = sum of:
    0.072880834 = weight(_text_:citation in 184) [ClassicSimilarity], result of:
      0.072880834 = score(doc=184,freq=2.0), product of:
        0.23445003 = queryWeight, product of:
          4.6892867 = idf(docFreq=1104, maxDocs=44218)
          0.04999695 = queryNorm
        0.31085873 = fieldWeight in 184, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.6892867 = idf(docFreq=1104, maxDocs=44218)
          0.046875 = fieldNorm(doc=184)
  0.33333334 = coord(1/3)
```
Abstract

The study examined Web co-links to Canadian university Web sites. Multidimensional scaling (MDS) was used to analyze and visualize co-link data as was done in co-citation analysis. Co-link data were collected in ways that would reflect three different views, the global view, the French Canada view, and the English Canada view. Mapping results of the three data sets accurately reflected the ways Canadians see the universities and clearly showed the linguistic and cultural differences within Canadian society. This shows that Web co-linking is not a random phenomenon and that co-link data contain useful information for Web data mining. It is proposed that the method developed in the study can be applied to other contexts such as analyzing relationships of different organizations or countries. This kind of research is promising because of the dynamics and the diversity of the Web.
Vaughan, L.; Yang, R.: Web data as academic and business quality estimates : a comparison of three data sources (2012) 0.02
```
0.020244677 = product of:
  0.06073403 = sum of:
    0.06073403 = weight(_text_:citation in 452) [ClassicSimilarity], result of:
      0.06073403 = score(doc=452,freq=2.0), product of:
        0.23445003 = queryWeight, product of:
          4.6892867 = idf(docFreq=1104, maxDocs=44218)
          0.04999695 = queryNorm
        0.25904894 = fieldWeight in 452, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.6892867 = idf(docFreq=1104, maxDocs=44218)
          0.0390625 = fieldNorm(doc=452)
  0.33333334 = coord(1/3)
```
Abstract

Earlier studies found that web hyperlink data contain various types of information, ranging from academic to political, that can be used to analyze a variety of social phenomena. Specifically, the numbers of inlinks to academic websites are associated with academic performance, while the counts of inlinks to company websites correlate with business variables. However, the scarcity of sources from which to collect inlink data in recent years has required us to seek new data sources. The recent demise of the inlink search function of Yahoo! made this need more pressing. Different alternative variables or data sources have been proposed. This study compared three types of web data to determine which are better as academic and business quality estimates, and what are the relationships among the three data sources. The study found that Alexa inlink and Google URL citation data can replace Yahoo! inlink data and that the former is better than the latter. Alexa is even better than Yahoo!, which has been the main data source in recent years. The unique nature of Alexa data could explain its relative advantages over other data sources.
Vaughan, L.; Thelwall, M.: Scholarly use of the Web : what are the key inducers of links to journal Web sites? (2003) 0.01
```
0.009338023 = product of:
  0.02801407 = sum of:
    0.02801407 = product of:
      0.05602814 = sum of:
        0.05602814 = weight(_text_:reports in 1236) [ClassicSimilarity], result of:
          0.05602814 = score(doc=1236,freq=2.0), product of:
            0.2251839 = queryWeight, product of:
              4.503953 = idf(docFreq=1329, maxDocs=44218)
              0.04999695 = queryNorm
            0.24881059 = fieldWeight in 1236, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.503953 = idf(docFreq=1329, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1236)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)
```
Abstract

Web links have been studied by information scientists for at least six years but it is only in the past two that clear evidence has emerged to show that counts of links to scholarly Web spaces (universities and departments) can correlate significantly with research measures, giving some credence to their use for the investigation of scholarly communication. This paper reports an a study to investigate the factors that influence the creation of links to journal Web sites. An empirical approach is used: collecting data and testing for significant patterns. The specific questions addressed are whether site age and site content are inducers of links to a journal's Web site as measured by the ratio of link counts to Journal Impact Factors, two variables previously discovered to be related. A new methodology for data collection is also introduced that uses the Internet Archive to obtain an earliest known creation date for Web sites. The results show that both site age and site content are significant factors for the disciplines studied: library and information science, and law. Comparisons between the two fields also show disciplinary differences in Web site characteristics. Scholars and publishers should be particularly aware that richer content an a journal's Web site tends to generate links and thus the traffic to the site.
Vaughan, L.: Uncovering information from social media hyperlinks (2016) 0.01
```
0.009338023 = product of:
  0.02801407 = sum of:
    0.02801407 = product of:
      0.05602814 = sum of:
        0.05602814 = weight(_text_:reports in 2892) [ClassicSimilarity], result of:
          0.05602814 = score(doc=2892,freq=2.0), product of:
            0.2251839 = queryWeight, product of:
              4.503953 = idf(docFreq=1329, maxDocs=44218)
              0.04999695 = queryNorm
            0.24881059 = fieldWeight in 2892, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.503953 = idf(docFreq=1329, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2892)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)
```
Abstract

Analyzing hyperlink patterns has been a major research topic since the early days of the web. Numerous studies reported uncovering rich information and methodological advances. However, very few studies thus far examined hyperlinks in the rapidly developing sphere of social media. This paper reports a study that helps fill this gap. The study analyzed links originating from tweets to the websites of 3 types of organizations (government, education, and business). Data were collected over an 8-month period to observe the fluctuation and reliability of the individual data set. Hyperlink data from the general web (not social media sites) were also collected and compared with social media data. The study found that the 2 types of hyperlink data correlated significantly and that analyzing the 2 together can help organizations see their relative strength or weakness in the two platforms. The study also found that both types of inlink data correlated with offline measures of organizations' performance. Twitter data from a relatively short period were fairly reliable in estimating performance measures. The timelier nature of social media data as well as the date/time stamps on tweets make this type of data potentially more valuable than that from the general web.

Vaughan, L.; Chen, Y.: Data mining from web search queries : a comparison of Google trends and Baidu index (2015) 0.01

0.0056449138 = product of:
  0.016934741 = sum of:
    0.016934741 = product of:
      0.033869483 = sum of:
        0.033869483 = weight(_text_:22 in 1605) [ClassicSimilarity], result of:
          0.033869483 = score(doc=1605,freq=2.0), product of:
            0.1750808 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04999695 = queryNorm
            0.19345059 = fieldWeight in 1605, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1605)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Source: Journal of the Association for Information Science and Technology. 66(2015) no.1, S.13-22

Search (9 results, page 1 of 1)

Authors

Years

Themes