Search (58 results, page 2 of 3)

Kousha, K.; Thelwall, M.: Google book search : citation analysis for social science and the humanities (2009) 0.00
```
0.0032419965 = product of:
  0.029177967 = sum of:
    0.029177967 = weight(_text_:data in 2946) [ClassicSimilarity], result of:
      0.029177967 = score(doc=2946,freq=4.0), product of:
        0.118112594 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.037353165 = queryNorm
        0.24703519 = fieldWeight in 2946, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2946)
  0.11111111 = coord(1/9)
```
Abstract

In both the social sciences and the humanities, books and monographs play significant roles in research communication. The absence of citations from most books and monographs from the Thomson Reuters/Institute for Scientific Information databases (ISI) has been criticized, but attempts to include citations from or to books in the research evaluation of the social sciences and humanities have not led to widespread adoption. This article assesses whether Google Book Search (GBS) can partially fill this gap by comparing citations from books with citations from journal articles to journal articles in 10 science, social science, and humanities disciplines. Book citations were 31% to 212% of ISI citations and, hence, numerous enough to supplement ISI citations in the social sciences and humanities covered, but not in the sciences (3%-5%), except for computing (46%), due to numerous published conference proceedings. A case study was also made of all 1,923 articles in the 51 information science and library science ISI-indexed journals published in 2003. Within this set, highly book-cited articles tended to receive many ISI citations, indicating a significant relationship between the two types of citation data, but with important exceptions that point to the additional information provided by book citations. In summary, GBS is clearly a valuable new source of citation data for the social sciences and humanities. One practical implication is that book-oriented scholars should consult it for additional citations to their work when applying for promotion and tenure.
Thelwall, M.; Buckley, K.; Paltoglou, G.: Sentiment strength detection for the social web (2012) 0.00
```
0.0032419965 = product of:
  0.029177967 = sum of:
    0.029177967 = weight(_text_:data in 4972) [ClassicSimilarity], result of:
      0.029177967 = score(doc=4972,freq=4.0), product of:
        0.118112594 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.037353165 = queryNorm
        0.24703519 = fieldWeight in 4972, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4972)
  0.11111111 = coord(1/9)
```
Abstract

Sentiment analysis is concerned with the automatic extraction of sentiment-related information from text. Although most sentiment analysis addresses commercial tasks, such as extracting opinions from product reviews, there is increasing interest in the affective dimension of the social web, and Twitter in particular. Most sentiment analysis algorithms are not ideally suited to this task because they exploit indirect indicators of sentiment that can reflect genre or topic instead. Hence, such algorithms used to process social web texts can identify spurious sentiment patterns caused by topics rather than affective phenomena. This article assesses an improved version of the algorithm SentiStrength for sentiment strength detection across the social web that primarily uses direct indications of sentiment. The results from six diverse social web data sets (MySpace, Twitter, YouTube, Digg, Runners World, BBC Forums) indicate that SentiStrength 2 is successful in the sense of performing better than a baseline approach for all data sets in both supervised and unsupervised cases. SentiStrength is not always better than machine-learning approaches that exploit indirect indicators of sentiment, however, and is particularly weaker for positive sentiment in news-related discussions. Overall, the results suggest that, even unsupervised, SentiStrength is robust enough to be applied to a wide variety of different social web contexts.
Haustein, S.; Peters, I.; Sugimoto, C.R.; Thelwall, M.; Larivière, V.: Tweeting biomedicine : an analysis of tweets and citations in the biomedical literature (2014) 0.00
```
0.0032419965 = product of:
  0.029177967 = sum of:
    0.029177967 = weight(_text_:data in 1229) [ClassicSimilarity], result of:
      0.029177967 = score(doc=1229,freq=4.0), product of:
        0.118112594 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.037353165 = queryNorm
        0.24703519 = fieldWeight in 1229, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1229)
  0.11111111 = coord(1/9)
```
Abstract

Data collected by social media platforms have been introduced as new sources for indicators to help measure the impact of scholarly research in ways that are complementary to traditional citation analysis. Data generated from social media activities can be used to reflect broad types of impact. This article aims to provide systematic evidence about how often Twitter is used to disseminate information about journal articles in the biomedical sciences. The analysis is based on 1.4 million documents covered by both PubMed and Web of Science and published between 2010 and 2012. The number of tweets containing links to these documents was analyzed and compared to citations to evaluate the degree to which certain journals, disciplines, and specialties were represented on Twitter and how far tweets correlate with citation impact. With less than 10% of PubMed articles mentioned on Twitter, its uptake is low in general but differs between journals and specialties. Correlations between tweets and citations are low, implying that impact metrics based on tweets are different from those based on citations. A framework using the coverage of articles and the correlation between Twitter mentions and citations is proposed to facilitate the evaluation of novel social-media-based metrics.
Mohammadi, E.; Thelwall, M.; Haustein, S.; Larivière, V.: Who reads research articles? : an altmetrics analysis of Mendeley user categories (2015) 0.00
```
0.0032419965 = product of:
  0.029177967 = sum of:
    0.029177967 = weight(_text_:data in 2162) [ClassicSimilarity], result of:
      0.029177967 = score(doc=2162,freq=4.0), product of:
        0.118112594 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.037353165 = queryNorm
        0.24703519 = fieldWeight in 2162, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2162)
  0.11111111 = coord(1/9)
```
Abstract

Little detailed information is known about who reads research articles and the contexts in which research articles are read. Using data about people who register in Mendeley as readers of articles, this article explores different types of users of Clinical Medicine, Engineering and Technology, Social Science, Physics, and Chemistry articles inside and outside academia. The majority of readers for all disciplines were PhD students, postgraduates, and postdocs but other types of academics were also represented. In addition, many Clinical Medicine articles were read by medical professionals. The highest correlations between citations and Mendeley readership counts were found for types of users who often authored academic articles, except for associate professors in some sub-disciplines. This suggests that Mendeley readership can reflect usage similar to traditional citation impact if the data are restricted to readers who are also authors without the delay of impact measured by citation counts. At the same time, Mendeley statistics can also reveal the hidden impact of some research articles, such as educational value for nonauthor users inside academia or the impact of research articles on practice for readers outside academia.
Thelwall, M.: Text characteristics of English language university Web sites (2005) 0.00
```
0.002750925 = product of:
  0.024758326 = sum of:
    0.024758326 = weight(_text_:data in 3463) [ClassicSimilarity], result of:
      0.024758326 = score(doc=3463,freq=2.0), product of:
        0.118112594 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.037353165 = queryNorm
        0.2096163 = fieldWeight in 3463, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.046875 = fieldNorm(doc=3463)
  0.11111111 = coord(1/9)
```
Abstract

The nature of the contents of academic Web sites is of direct relevance to the new field of scientific Web intelligence, and for search engine and topic-specific crawler designers. We analyze word frequencies in national academic Webs using the Web sites of three Englishspeaking nations: Australia, New Zealand, and the United Kingdom. Strong regularities were found in page size and word frequency distributions, but with significant anomalies. At least 26% of pages contain no words. High frequency words include university names and acronyms, Internet terminology, and computing product names: not always words in common usage away from the Web. A minority of low frequency words are spelling mistakes, with other common types including nonwords, proper names, foreign language terms or computer science variable names. Based upon these findings, recommendations for data cleansing and filtering are made, particularly for clustering applications.
Angus, E.; Thelwall, M.; Stuart, D.: General patterns of tag usage among university groups in Flickr (2008) 0.00
```
0.002750925 = product of:
  0.024758326 = sum of:
    0.024758326 = weight(_text_:data in 2554) [ClassicSimilarity], result of:
      0.024758326 = score(doc=2554,freq=2.0), product of:
        0.118112594 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.037353165 = queryNorm
        0.2096163 = fieldWeight in 2554, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.046875 = fieldNorm(doc=2554)
  0.11111111 = coord(1/9)
```
Abstract

Purpose - The purpose of this research is to investigate general patterns of tag usage and determines the usefulness of the tags used within university image groups to the wider Flickr community. There has been a significant rise in the use of Web 2.0 social network web sites and online applications in recent years. One of the most popular is Flickr, an online image management application. Design/methodology/approach - This study uses a webometric data collection, classification and informetric analysis. Findings - The results show that members of university image groups tend to tag in a manner that is of use to users of the system as a whole rather than merely for the tag creator. Originality/value - This paper gives a valuable insight into the tagging practices of image groups in Flickr.

Thelwall, M.; Wilkinson, D.: Public dialogs in social network sites : What is their purpose? (2010) 0.00

0.002750925 = product of:
  0.024758326 = sum of:
    0.024758326 = weight(_text_:data in 3327) [ClassicSimilarity], result of:
      0.024758326 = score(doc=3327,freq=2.0), product of:
        0.118112594 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.037353165 = queryNorm
        0.2096163 = fieldWeight in 3327, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.046875 = fieldNorm(doc=3327)
  0.11111111 = coord(1/9)

Theme: Data Mining

Thelwall, M.: Webometrics (2009) 0.00
```
0.002750925 = product of:
  0.024758326 = sum of:
    0.024758326 = weight(_text_:data in 3906) [ClassicSimilarity], result of:
      0.024758326 = score(doc=3906,freq=2.0), product of:
        0.118112594 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.037353165 = queryNorm
        0.2096163 = fieldWeight in 3906, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.046875 = fieldNorm(doc=3906)
  0.11111111 = coord(1/9)
```
Abstract

Webometrics is an information science field concerned with measuring aspects of the World Wide Web (WWW) for a variety of information science research goals. It came into existence about five years after the Web was formed and has since grown to become a significant aspect of information science, at least in terms of published research. Although some webometrics research has focused on the structure or evolution of the Web itself or the performance of commercial search engines, most has used data from the Web to shed light on information provision or online communication in various contexts. Most prominently, techniques have been developed to track, map, and assess Web-based informal scholarly communication, for example, in terms of the hyperlinks between academic Web sites or the online impact of digital repositories. In addition, a range of nonacademic issues and groups of Web users have also been analyzed.
Didegah, F.; Thelwall, M.: Determinants of research citation impact in nanoscience and nanotechnology (2013) 0.00
```
0.002750925 = product of:
  0.024758326 = sum of:
    0.024758326 = weight(_text_:data in 737) [ClassicSimilarity], result of:
      0.024758326 = score(doc=737,freq=2.0), product of:
        0.118112594 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.037353165 = queryNorm
        0.2096163 = fieldWeight in 737, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.046875 = fieldNorm(doc=737)
  0.11111111 = coord(1/9)
```
Abstract

This study investigates a range of metrics available when a nanoscience and nanotechnology article is published to see which metrics correlate more with the number of citations to the article. It also introduces the degree of internationality of journals and references as new metrics for this purpose. The journal impact factor; the impact of references; the internationality of authors, journals, and references; and the number of authors, institutions, and references were all calculated for papers published in nanoscience and nanotechnology journals in the Web of Science from 2007 to 2009. Using a zero-inflated negative binomial regression model on the data set, the impact factor of the publishing journal and the citation impact of the cited references were found to be the most effective determinants of citation counts in all four time periods. In the entire 2007 to 2009 period, apart from journal internationality and author numbers and internationality, all other predictor variables had significant effects on citation counts.
Shema, H.; Bar-Ilan, J.; Thelwall, M.: Do blog citations correlate with a higher number of future citations? : Research blogs as a potential source for alternative metrics (2014) 0.00
```
0.002750925 = product of:
  0.024758326 = sum of:
    0.024758326 = weight(_text_:data in 1258) [ClassicSimilarity], result of:
      0.024758326 = score(doc=1258,freq=2.0), product of:
        0.118112594 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.037353165 = queryNorm
        0.2096163 = fieldWeight in 1258, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.046875 = fieldNorm(doc=1258)
  0.11111111 = coord(1/9)
```
Abstract

Journal-based citations are an important source of data for impact indices. However, the impact of journal articles extends beyond formal scholarly discourse. Measuring online scholarly impact calls for new indices, complementary to the older ones. This article examines a possible alternative metric source, blog posts aggregated at ResearchBlogging.org, which discuss peer-reviewed articles and provide full bibliographic references. Articles reviewed in these blogs therefore receive "blog citations." We hypothesized that articles receiving blog citations close to their publication time receive more journal citations later than the articles in the same journal published in the same year that did not receive such blog citations. Statistically significant evidence for articles published in 2009 and 2010 support this hypothesis for seven of 12 journals (58%) in 2009 and 13 of 19 journals (68%) in 2010. We suggest, based on these results, that blog citations can be used as an alternative metric source.
Thelwall, M.; Kousha, K.: ResearchGate: Disseminating, communicating, and measuring scholarship? (2015) 0.00
```
0.002750925 = product of:
  0.024758326 = sum of:
    0.024758326 = weight(_text_:data in 1813) [ClassicSimilarity], result of:
      0.024758326 = score(doc=1813,freq=2.0), product of:
        0.118112594 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.037353165 = queryNorm
        0.2096163 = fieldWeight in 1813, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.046875 = fieldNorm(doc=1813)
  0.11111111 = coord(1/9)
```
Abstract

ResearchGate is a social network site for academics to create their own profiles, list their publications, and interact with each other. Like Academia.edu, it provides a new way for scholars to disseminate their work and hence potentially changes the dynamics of informal scholarly communication. This article assesses whether ResearchGate usage and publication data broadly reflect existing academic hierarchies and whether individual countries are set to benefit or lose out from the site. The results show that rankings based on ResearchGate statistics correlate moderately well with other rankings of academic institutions, suggesting that ResearchGate use broadly reflects the traditional distribution of academic capital. Moreover, while Brazil, India, and some other countries seem to be disproportionately taking advantage of ResearchGate, academics in China, South Korea, and Russia may be missing opportunities to use ResearchGate to maximize the academic impact of their publications.
Maflahi, N.; Thelwall, M.: When are readership counts as useful as citation counts? : Scopus versus Mendeley for LIS journals (2016) 0.00
```
0.002750925 = product of:
  0.024758326 = sum of:
    0.024758326 = weight(_text_:data in 2495) [ClassicSimilarity], result of:
      0.024758326 = score(doc=2495,freq=2.0), product of:
        0.118112594 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.037353165 = queryNorm
        0.2096163 = fieldWeight in 2495, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.046875 = fieldNorm(doc=2495)
  0.11111111 = coord(1/9)
```
Abstract

In theory, articles can attract readers on the social reference sharing site Mendeley before they can attract citations, so Mendeley altmetrics could provide early indications of article impact. This article investigates the influence of time on the number of Mendeley readers of an article through a theoretical discussion and an investigation into the relationship between counts of readers of, and citations to, 4 general library and information science (LIS) journals. For this discipline, it takes about 7 years for articles to attract as many Scopus citations as Mendeley readers, and after this the Spearman correlation between readers and citers is stable at about 0.6 for all years. This suggests that Mendeley readership counts may be useful impact indicators for both newer and older articles. The lack of dates for individual Mendeley article readers and an unknown bias toward more recent articles mean that readership data should be normalized individually by year, however, before making any comparisons between articles published in different years.
Thelwall, M.: Book genre and author gender : romance > paranormal-romance to autobiography > memoir (2017) 0.00
```
0.002750925 = product of:
  0.024758326 = sum of:
    0.024758326 = weight(_text_:data in 3598) [ClassicSimilarity], result of:
      0.024758326 = score(doc=3598,freq=2.0), product of:
        0.118112594 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.037353165 = queryNorm
        0.2096163 = fieldWeight in 3598, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.046875 = fieldNorm(doc=3598)
  0.11111111 = coord(1/9)
```
Abstract

Although gender differences are known to exist in the publishing industry and in reader preferences, there is little public systematic data about them. This article uses evidence from the book-based social website Goodreads to provide a large scale analysis of 50 major English book genres based on author genders. The results show gender differences in authorship in almost all categories and gender differences the level of interest in, and ratings of, books in a minority of categories. Perhaps surprisingly in this context, there is not a clear gender-based relationship between the success of an author and their prevalence within a genre. The unexpected almost universal authorship gender differences should give new impetus to investigations of the importance of gender in fiction and the success of minority genders in some genres should encourage publishers and librarians to take their work seriously, except perhaps for most male-authored chick-lit.
Zuccala, A.; Thelwall, M.; Oppenheim, C.; Dhiensa, R.: Web intelligence analyses of digital libraries : a case study of the National electronic Library for Health (NeLH) (2007) 0.00
```
0.0025935972 = product of:
  0.023342375 = sum of:
    0.023342375 = weight(_text_:data in 838) [ClassicSimilarity], result of:
      0.023342375 = score(doc=838,freq=4.0), product of:
        0.118112594 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.037353165 = queryNorm
        0.19762816 = fieldWeight in 838, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.03125 = fieldNorm(doc=838)
  0.11111111 = coord(1/9)
```
Abstract

Purpose - The purpose of this paper is to explore the use of LexiURL as a Web intelligence tool for collecting and analysing links to digital libraries, focusing specifically on the National electronic Library for Health (NeLH). Design/methodology/approach - The Web intelligence techniques in this study are a combination of link analysis (web structure mining), web server log file analysis (web usage mining), and text analysis (web content mining), utilizing the power of commercial search engines and drawing upon the information science fields of bibliometrics and webometrics. LexiURL is a computer program designed to calculate summary statistics for lists of links or URLs. Its output is a series of standard reports, for example listing and counting all of the different domain names in the data. Findings - Link data, when analysed together with user transaction log files (i.e. Web referring domains) can provide insights into who is using a digital library and when, and who could be using the digital library if they are "surfing" a particular part of the Web; in this case any site that is linked to or colinked with the NeLH. This study found that the NeLH was embedded in a multifaceted Web context, including many governmental, educational, commercial and organisational sites, with the most interesting being sites from the.edu domain, representing American Universities. Not many links directed to the NeLH were followed on September 25, 2005 (the date of the log file analysis and link extraction analysis), which means that users who access the digital library have been arriving at the site via only a few select links, bookmarks and search engine searches, or non-electronic sources. Originality/value - A number of studies concerning digital library users have been carried out using log file analysis as a research tool. Log files focus on real-time user transactions; while LexiURL can be used to extract links and colinks associated with a digital library's growing Web network. This Web network is not recognized often enough, and can be a useful indication of where potential users are surfing, even if they have not yet specifically visited the NeLH site.
Thelwall, M.: Extracting macroscopic information from Web links (2001) 0.00
```
0.0022924377 = product of:
  0.02063194 = sum of:
    0.02063194 = weight(_text_:data in 6851) [ClassicSimilarity], result of:
      0.02063194 = score(doc=6851,freq=2.0), product of:
        0.118112594 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.037353165 = queryNorm
        0.17468026 = fieldWeight in 6851, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.0390625 = fieldNorm(doc=6851)
  0.11111111 = coord(1/9)
```
Abstract

Much has been written about the potential and pitfalls of macroscopic Web-based link analysis, yet there have been no studies that have provided clear statistical evidence that any of the proposed calculations can produce results over large areas of the Web that correlate with phenomena external to the Internet. This article attempts to provide such evidence through an evaluation of Ingwersen's (1998) proposed external Web Impact Factor (WIF) for the original use of the Web: the interlinking of academic research. In particular, it studies the case of the relationship between academic hyperlinks and research activity for universities in Britain, a country chosen for its variety of institutions and the existence of an official government rating exercise for research. After reviewing the numerous reasons why link counts may be unreliable, it demonstrates that four different WIFs do, in fact, correlate with the conventional academic research measures. The WIF delivering the greatest correlation with research rankings was the ratio of Web pages with links pointing at research-based pages to faculty numbers. The scarcity of links to electronic academic papers in the data set suggests that, in contrast to citation analysis, this WIF is measuring the reputations of universities and their scholars, rather than the quality of their publications
Thelwall, M.: Conceptualizing documentation on the Web : an evaluation of different heuristic-based models for counting links between university Web sites (2002) 0.00
```
0.0022924377 = product of:
  0.02063194 = sum of:
    0.02063194 = weight(_text_:data in 978) [ClassicSimilarity], result of:
      0.02063194 = score(doc=978,freq=2.0), product of:
        0.118112594 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.037353165 = queryNorm
        0.17468026 = fieldWeight in 978, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.0390625 = fieldNorm(doc=978)
  0.11111111 = coord(1/9)
```
Abstract

All known previous Web link studies have used the Web page as the primary indivisible source document for counting purposes. Arguments are presented to explain why this is not necessarily optimal and why other alternatives have the potential to produce better results. This is despite the fact that individual Web files are often the only choice if search engines are used for raw data and are the easiest basic Web unit to identify. The central issue is of defining the Web "document": that which should comprise the single indissoluble unit of coherent material. Three alternative heuristics are defined for the educational arena based upon the directory, the domain and the whole university site. These are then compared by implementing them an a set of 108 UK university institutional Web sites under the assumption that a more effective heuristic will tend to produce results that correlate more highly with institutional research productivity. It was discovered that the domain and directory models were able to successfully reduce the impact of anomalous linking behavior between pairs of Web sites, with the latter being the method of choice. Reasons are then given as to why a document model an its own cannot eliminate all anomalies in Web linking behavior. Finally, the results from all models give a clear confirmation of the very strong association between the research productivity of a UK university and the number of incoming links from its peers' Web sites.
Kousha, K.; Thelwall, M.: Google Scholar citations and Google Web/URL citations : a multi-discipline exploratory analysis (2007) 0.00
```
0.0022924377 = product of:
  0.02063194 = sum of:
    0.02063194 = weight(_text_:data in 337) [ClassicSimilarity], result of:
      0.02063194 = score(doc=337,freq=2.0), product of:
        0.118112594 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.037353165 = queryNorm
        0.17468026 = fieldWeight in 337, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.0390625 = fieldNorm(doc=337)
  0.11111111 = coord(1/9)
```
Abstract

We use a new data gathering method, "Web/URL citation," Web/URL and Google Scholar to compare traditional and Web-based citation patterns across multiple disciplines (biology, chemistry, physics, computing, sociology, economics, psychology, and education) based upon a sample of 1,650 articles from 108 open access (OA) journals published in 2001. A Web/URL citation of an online journal article is a Web mention of its title, URL, or both. For each discipline, except psychology, we found significant correlations between Thomson Scientific (formerly Thomson ISI, here: ISI) citations and both Google Scholar and Google Web/URL citations. Google Scholar citations correlated more highly with ISI citations than did Google Web/URL citations, indicating that the Web/URL method measures a broader type of citation phenomenon. Google Scholar citations were more numerous than ISI citations in computer science and the four social science disciplines, suggesting that Google Scholar is more comprehensive for social sciences and perhaps also when conference articles are valued and published online. We also found large disciplinary differences in the percentage overlap between ISI and Google Scholar citation sources. Finally, although we found many significant trends, there were also numerous exceptions, suggesting that replacing traditional citation sources with the Web or Google Scholar for research impact calculations would be problematic.
Thelwall, M.: Quantitative comparisons of search engine results (2008) 0.00
```
0.0022924377 = product of:
  0.02063194 = sum of:
    0.02063194 = weight(_text_:data in 2350) [ClassicSimilarity], result of:
      0.02063194 = score(doc=2350,freq=2.0), product of:
        0.118112594 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.037353165 = queryNorm
        0.17468026 = fieldWeight in 2350, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2350)
  0.11111111 = coord(1/9)
```
Abstract

Search engines are normally used to find information or Web sites, but Webometric investigations use them for quantitative data such as the number of pages matching a query and the international spread of those pages. For this type of application, the accuracy of the hit count estimates and range of URLs in the full results are important. Here, we compare the applications programming interfaces of Google, Yahoo!, and Live Search for 1,587 single word searches. The hit count estimates were broadly consistent but with Yahoo! and Google, reporting 5-6 times more hits than Live Search. Yahoo! tended to return slightly more matching URLs than Google, with Live Search returning significantly fewer. Yahoo!'s result URLs included a significantly wider range of domains and sites than the other two, and there was little consistency between the three engines in the number of different domains. In contrast, the three engines were reasonably consistent in the number of different top-level domains represented in the result URLs, although Yahoo! tended to return the most. In conclusion, quantitative results from the three search engines are mostly consistent but with unexpected types of inconsistency that users should be aware of. Google is recommended for hit count estimates but Yahoo! is recommended for all other Webometric purposes.
Kousha, K.; Thelwall, M.: Assessing the impact of disciplinary research on teaching : an automatic analysis of online syllabuses (2008) 0.00
```
0.0022924377 = product of:
  0.02063194 = sum of:
    0.02063194 = weight(_text_:data in 2383) [ClassicSimilarity], result of:
      0.02063194 = score(doc=2383,freq=2.0), product of:
        0.118112594 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.037353165 = queryNorm
        0.17468026 = fieldWeight in 2383, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2383)
  0.11111111 = coord(1/9)
```
Abstract

The impact of published academic research in the sciences and social sciences, when measured, is commonly estimated by counting citations from journal articles. The Web has now introduced new potential sources of quantitative data online that could be used to measure aspects of research impact. In this article we assess the extent to which citations from online syllabuses could be a valuable source of evidence about the educational utility of research. An analysis of online syllabus citations to 70,700 articles published in 2003 in the journals of 12 subjects indicates that online syllabus citations were sufficiently numerous to be a useful impact indictor in some social sciences, including political science and information and library science, but not in others, nor in any sciences. This result was consistent with current social science research having, in general, more educational value than current science research. Moreover, articles frequently cited in online syllabuses were not necessarily highly cited by other articles. Hence it seems that online syllabus citations provide a valuable additional source of evidence about the impact of journals, scholars, and research articles in some social sciences.
Thelwall, M.: ¬A comparison of link and URL citation counting (2011) 0.00
```
0.0022924377 = product of:
  0.02063194 = sum of:
    0.02063194 = weight(_text_:data in 4533) [ClassicSimilarity], result of:
      0.02063194 = score(doc=4533,freq=2.0), product of:
        0.118112594 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.037353165 = queryNorm
        0.17468026 = fieldWeight in 4533, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4533)
  0.11111111 = coord(1/9)
```
Abstract

Purpose - Link analysis is an established topic within webometrics. It normally uses counts of links between sets of web sites or to sets of web sites. These link counts are derived from web crawlers or commercial search engines with the latter being the only alternative for some investigations. This paper compares link counts with URL citation counts in order to assess whether the latter could be a replacement for the former if the major search engines withdraw their advanced hyperlink search facilities. Design/methodology/approach - URL citation counts are compared with link counts for a variety of data sets used in previous webometric studies. Findings - The results show a high degree of correlation between the two but with URL citations being much less numerous, at least outside academia and business. Research limitations/implications - The results cover a small selection of 15 case studies and so the findings are only indicative. Significant differences between results indicate that the difference between link counts and URL citation counts will vary between webometric studies. Practical implications - Should link searches be withdrawn, then link analyses of less well linked non-academic, non-commercial sites would be seriously weakened, although citations based on e-mail addresses could help to make citations more numerous than links for some business and academic contexts. Originality/value - This is the first systematic study of the difference between link counts and URL citation counts in a variety of contexts and it shows that there are significant differences between the two.

Search (58 results, page 2 of 3)

Authors

Years

Languages

Themes