Search (11 results, page 1 of 1)

Thelwall, M.; Wilkinson, D.: Finding similar academic Web sites with links, bibliometric couplings and colinks (2004) 0.04
```
0.036013078 = product of:
  0.14405231 = sum of:
    0.14405231 = weight(_text_:assess in 2571) [ClassicSimilarity], result of:
      0.14405231 = score(doc=2571,freq=2.0), product of:
        0.36863554 = queryWeight, product of:
          5.8947687 = idf(docFreq=330, maxDocs=44218)
          0.062536046 = queryNorm
        0.39077166 = fieldWeight in 2571, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.8947687 = idf(docFreq=330, maxDocs=44218)
          0.046875 = fieldNorm(doc=2571)
  0.25 = coord(1/4)
```
Abstract

A common task in both Webmetrics and Web information retrieval is to identify a set of Web pages or sites that are similar in content. In this paper we assess the extent to which links, colinks and couplings can be used to identify similar Web sites. As an experiment, a random sample of 500 pairs of domains from the UK academic Web were taken and human assessments of site similarity, based upon content type, were compared against ratings for the three concepts. The results show that using a combination of all three gives the highest probability of identifying similar sites, but surprisingly this was only a marginal improvement over using links alone. Another unexpected result was that high values for either colink counts or couplings were associated with only a small increased likelihood of similarity. The principal advantage of using couplings and colinks was found to be greater coverage in terms of a much larger number of pairs of sites being connected by these measures, instead of increased probability of similarity. In information retrieval terminology, this is improved recall rather than improved precision.
Thelwall, M.: Extracting accurate and complete results from search engines : case study windows live (2008) 0.04
```
0.036013078 = product of:
  0.14405231 = sum of:
    0.14405231 = weight(_text_:assess in 1338) [ClassicSimilarity], result of:
      0.14405231 = score(doc=1338,freq=2.0), product of:
        0.36863554 = queryWeight, product of:
          5.8947687 = idf(docFreq=330, maxDocs=44218)
          0.062536046 = queryNorm
        0.39077166 = fieldWeight in 1338, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.8947687 = idf(docFreq=330, maxDocs=44218)
          0.046875 = fieldNorm(doc=1338)
  0.25 = coord(1/4)
```
Abstract

Although designed for general Web searching, Webometrics and related research commercial search engines are also used to produce estimated hit counts or lists of URLs matching a query. Unfortunately, however, they do not return all matching URLs for a search and their hit count estimates are unreliable. In this article, we assess whether it is possible to obtain complete lists of matching URLs from Windows Live, and whether any of its hit count estimates are robust. As part of this, we introduce two new methods to extract extra URLs from search engines: automated query splitting and automated domain and TLD searching. Both methods successfully identify additional matching URLs but the findings suggest that there is no way to get complete lists of matching URLs or accurate hit counts from Windows Live, although some estimating suggestions are provided.
Shifman, L.; Thelwall, M.: Assessing global diffusion with Web memetics : the spread and evolution of a popular joke (2009) 0.04
```
0.036013078 = product of:
  0.14405231 = sum of:
    0.14405231 = weight(_text_:assess in 3303) [ClassicSimilarity], result of:
      0.14405231 = score(doc=3303,freq=2.0), product of:
        0.36863554 = queryWeight, product of:
          5.8947687 = idf(docFreq=330, maxDocs=44218)
          0.062536046 = queryNorm
        0.39077166 = fieldWeight in 3303, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.8947687 = idf(docFreq=330, maxDocs=44218)
          0.046875 = fieldNorm(doc=3303)
  0.25 = coord(1/4)
```
Abstract

Memes are small units of culture, analogous to genes, which flow from person to person by copying or imitation. More than any previous medium, the Internet has the technical capabilities for global meme diffusion. Yet, to spread globally, memes need to negotiate their way through cultural and linguistic borders. This article introduces a new broad method, Web memetics, comprising extensive Web searches and combined quantitative and qualitative analyses, to identify and assess: (a) the different versions of a meme, (b) its evolution online, and (c) its Web presence and translation into common Internet languages. This method is demonstrated through one extensively circulated joke about men, women, and computers. The results show that the joke has mutated into several different versions and is widely translated, and that translations incorporate small, local adaptations while retaining the English versions' fundamental components. In conclusion, Web memetics has demonstrated its ability to identify and track the evolution and spread of memes online, with interesting results, albeit for only one case study.
Thelwall, M.: Webometrics (2009) 0.04
```
0.036013078 = product of:
  0.14405231 = sum of:
    0.14405231 = weight(_text_:assess in 3906) [ClassicSimilarity], result of:
      0.14405231 = score(doc=3906,freq=2.0), product of:
        0.36863554 = queryWeight, product of:
          5.8947687 = idf(docFreq=330, maxDocs=44218)
          0.062536046 = queryNorm
        0.39077166 = fieldWeight in 3906, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.8947687 = idf(docFreq=330, maxDocs=44218)
          0.046875 = fieldNorm(doc=3906)
  0.25 = coord(1/4)
```
Abstract

Webometrics is an information science field concerned with measuring aspects of the World Wide Web (WWW) for a variety of information science research goals. It came into existence about five years after the Web was formed and has since grown to become a significant aspect of information science, at least in terms of published research. Although some webometrics research has focused on the structure or evolution of the Web itself or the performance of commercial search engines, most has used data from the Web to shed light on information provision or online communication in various contexts. Most prominently, techniques have been developed to track, map, and assess Web-based informal scholarly communication, for example, in terms of the hyperlinks between academic Web sites or the online impact of digital repositories. In addition, a range of nonacademic issues and groups of Web users have also been analyzed.
Thelwall, M.; Harries, G.: Do the Web Sites of Higher Rated Scholars Have Significantly More Online Impact? (2004) 0.03
```
0.030010901 = product of:
  0.120043606 = sum of:
    0.120043606 = weight(_text_:assess in 2123) [ClassicSimilarity], result of:
      0.120043606 = score(doc=2123,freq=2.0), product of:
        0.36863554 = queryWeight, product of:
          5.8947687 = idf(docFreq=330, maxDocs=44218)
          0.062536046 = queryNorm
        0.32564306 = fieldWeight in 2123, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.8947687 = idf(docFreq=330, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2123)
  0.25 = coord(1/4)
```
Abstract

The quality and impact of academic Web sites is of interest to many audiences, including the scholars who use them and Web educators who need to identify best practice. Several large-scale European Union research projects have been funded to build new indicators for online scientific activity, reflecting recognition of the importance of the Web for scholarly communication. In this paper we address the key question of whether higher rated scholars produce higher impact Web sites, using the United Kingdom as a case study and measuring scholars' quality in terms of university-wide average research ratings. Methodological issues concerning the measurement of the online impact are discussed, leading to the adoption of counts of links to a university's constituent single domain Web sites from an aggregated counting metric. The findings suggest that universities with higher rated scholars produce significantly more Web content but with a similar average online impact. Higher rated scholars therefore attract more total links from their peers, but only by being more prolific, refuting earlier suggestions. It can be surmised that general Web publications are very different from scholarly journal articles and conference papers, for which scholarly quality does associate with citation impact. This has important implications for the construction of new Web indicators, for example that online impact should not be used to assess the quality of small groups of scholars, even within a single discipline.
Thelwall, M.; Prabowo, R.; Fairclough, R.: Are raw RSS feeds suitable for broad issue scanning? : a science concern case study (2006) 0.03
```
0.030010901 = product of:
  0.120043606 = sum of:
    0.120043606 = weight(_text_:assess in 6116) [ClassicSimilarity], result of:
      0.120043606 = score(doc=6116,freq=2.0), product of:
        0.36863554 = queryWeight, product of:
          5.8947687 = idf(docFreq=330, maxDocs=44218)
          0.062536046 = queryNorm
        0.32564306 = fieldWeight in 6116, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.8947687 = idf(docFreq=330, maxDocs=44218)
          0.0390625 = fieldNorm(doc=6116)
  0.25 = coord(1/4)
```
Abstract

Broad issue scanning is the task of identifying important public debates arising in a given broad issue; really simple syndication (RSS) feeds are a natural information source for investigating broad issues. RSS, as originally conceived, is a method for publishing timely and concise information on the Internet, for example, about the main stories in a news site or the latest postings in a blog. RSS feeds are potentially a nonintrusive source of high-quality data about public opinion: Monitoring a large number may allow quantitative methods to extract information relevant to a given need. In this article we describe an RSS feed-based coword frequency method to identify bursts of discussion relevant to a given broad issue. A case study of public science concerns is used to demonstrate the method and assess the suitability of raw RSS feeds for broad issue scanning (i.e., without data cleansing). An attempt to identify genuine science concern debates from the corpus through investigating the top 1,000 "burst" words found only two genuine debates, however. The low success rate was mainly caused by a few pathological feeds that dominated the results and obscured any significant debates. The results point to the need to develop effective data cleansing procedures for RSS feeds, particularly if there is not a large quantity of discussion about the broad issue, and a range of potential techniques is suggested. Finally, the analysis confirmed that the time series information generated by real-time monitoring of RSS feeds could usefully illustrate the evolution of new debates relevant to a broad issue.
Thelwall, M.; Li, X.; Barjak, F.; Robinson, S.: Assessing the international web connectivity of research groups (2008) 0.03
```
0.030010901 = product of:
  0.120043606 = sum of:
    0.120043606 = weight(_text_:assess in 1401) [ClassicSimilarity], result of:
      0.120043606 = score(doc=1401,freq=2.0), product of:
        0.36863554 = queryWeight, product of:
          5.8947687 = idf(docFreq=330, maxDocs=44218)
          0.062536046 = queryNorm
        0.32564306 = fieldWeight in 1401, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.8947687 = idf(docFreq=330, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1401)
  0.25 = coord(1/4)
```
Abstract

Purpose - The purpose of this paper is to claim that it is useful to assess the web connectivity of research groups, describe hyperlink-based techniques to achieve this and present brief details of European life sciences research groups as a case study. Design/methodology/approach - A commercial search engine was harnessed to deliver hyperlink data via its automatic query submission interface. A special purpose link analysis tool, LexiURL, then summarised and graphed the link data in appropriate ways. Findings - Webometrics can provide a wide range of descriptive information about the international connectivity of research groups. Research limitations/implications - Only one field was analysed, data was taken from only one search engine, and the results were not validated. Practical implications - Web connectivity seems to be particularly important for attracting overseas job applicants and to promote research achievements and capabilities, and hence we contend that it can be useful for national and international governments to use webometrics to ensure that the web is being used effectively by research groups. Originality/value - This is the first paper to make a case for the value of using a range of webometric techniques to evaluate the web presences of research groups within a field, and possibly the first "applied" webometrics study produced for an external contract.
Kousha, K.; Thelwall, M.: Assessing the impact of disciplinary research on teaching : an automatic analysis of online syllabuses (2008) 0.03
```
0.030010901 = product of:
  0.120043606 = sum of:
    0.120043606 = weight(_text_:assess in 2383) [ClassicSimilarity], result of:
      0.120043606 = score(doc=2383,freq=2.0), product of:
        0.36863554 = queryWeight, product of:
          5.8947687 = idf(docFreq=330, maxDocs=44218)
          0.062536046 = queryNorm
        0.32564306 = fieldWeight in 2383, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.8947687 = idf(docFreq=330, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2383)
  0.25 = coord(1/4)
```
Abstract

The impact of published academic research in the sciences and social sciences, when measured, is commonly estimated by counting citations from journal articles. The Web has now introduced new potential sources of quantitative data online that could be used to measure aspects of research impact. In this article we assess the extent to which citations from online syllabuses could be a valuable source of evidence about the educational utility of research. An analysis of online syllabus citations to 70,700 articles published in 2003 in the journals of 12 subjects indicates that online syllabus citations were sufficiently numerous to be a useful impact indictor in some social sciences, including political science and information and library science, but not in others, nor in any sciences. This result was consistent with current social science research having, in general, more educational value than current science research. Moreover, articles frequently cited in online syllabuses were not necessarily highly cited by other articles. Hence it seems that online syllabus citations provide a valuable additional source of evidence about the impact of journals, scholars, and research articles in some social sciences.

Thelwall, M.; Ruschenburg, T.: Grundlagen und Forschungsfelder der Webometrie (2006) 0.02

0.016945543 = product of:
  0.06778217 = sum of:
    0.06778217 = weight(_text_:22 in 77) [ClassicSimilarity], result of:
      0.06778217 = score(doc=77,freq=2.0), product of:
        0.21899058 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.062536046 = queryNorm
        0.30952093 = fieldWeight in 77, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.0625 = fieldNorm(doc=77)
  0.25 = coord(1/4)

Date: 4.12.2006 12:12:22

Levitt, J.M.; Thelwall, M.: Citation levels and collaboration within library and information science (2009) 0.01
```
0.0149778845 = product of:
  0.059911538 = sum of:
    0.059911538 = weight(_text_:22 in 2734) [ClassicSimilarity], result of:
      0.059911538 = score(doc=2734,freq=4.0), product of:
        0.21899058 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.062536046 = queryNorm
        0.27358043 = fieldWeight in 2734, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2734)
  0.25 = coord(1/4)
```
Abstract

Collaboration is a major research policy objective, but does it deliver higher quality research? This study uses citation analysis to examine the Web of Science (WoS) Information Science & Library Science subject category (IS&LS) to ascertain whether, in general, more highly cited articles are more highly collaborative than other articles. It consists of two investigations. The first investigation is a longitudinal comparison of the degree and proportion of collaboration in five strata of citation; it found that collaboration in the highest four citation strata (all in the most highly cited 22%) increased in unison over time, whereas collaboration in the lowest citation strata (un-cited articles) remained low and stable. Given that over 40% of the articles were un-cited, it seems important to take into account the differences found between un-cited articles and relatively highly cited articles when investigating collaboration in IS&LS. The second investigation compares collaboration for 35 influential information scientists; it found that their more highly cited articles on average were not more highly collaborative than their less highly cited articles. In summary, although collaborative research is conducive to high citation in general, collaboration has apparently not tended to be essential to the success of current and former elite information scientists.

Date

22. 3.2009 12:43:51
Kousha, K.; Thelwall, M.: How is science cited on the Web? : a classification of google unique Web citations (2007) 0.01
```
0.010590964 = product of:
  0.042363856 = sum of:
    0.042363856 = weight(_text_:22 in 586) [ClassicSimilarity], result of:
      0.042363856 = score(doc=586,freq=2.0), product of:
        0.21899058 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.062536046 = queryNorm
        0.19345059 = fieldWeight in 586, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.0390625 = fieldNorm(doc=586)
  0.25 = coord(1/4)
```
Abstract

Although the analysis of citations in the scholarly literature is now an established and relatively well understood part of information science, not enough is known about citations that can be found on the Web. In particular, are there new Web types, and if so, are these trivial or potentially useful for studying or evaluating research communication? We sought evidence based upon a sample of 1,577 Web citations of the URLs or titles of research articles in 64 open-access journals from biology, physics, chemistry, and computing. Only 25% represented intellectual impact, from references of Web documents (23%) and other informal scholarly sources (2%). Many of the Web/URL citations were created for general or subject-specific navigation (45%) or for self-publicity (22%). Additional analyses revealed significant disciplinary differences in the types of Google unique Web/URL citations as well as some characteristics of scientific open-access publishing on the Web. We conclude that the Web provides access to a new and different type of citation information, one that may therefore enable us to measure different aspects of research, and the research process in particular; but to obtain good information, the different types should be separated.

Search (11 results, page 1 of 1)

Authors

Languages

Themes