Search (12 results, page 1 of 1)

Thelwall, M.: ¬A comparison of link and URL citation counting (2011) 0.07
```
0.06923047 = product of:
  0.13846093 = sum of:
    0.1204711 = weight(_text_:sites in 4533) [ClassicSimilarity], result of:
      0.1204711 = score(doc=4533,freq=6.0), product of:
        0.2408473 = queryWeight, product of:
          5.227637 = idf(docFreq=644, maxDocs=44218)
          0.046071928 = queryNorm
        0.500197 = fieldWeight in 4533, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          5.227637 = idf(docFreq=644, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4533)
    0.01798983 = product of:
      0.03597966 = sum of:
        0.03597966 = weight(_text_:design in 4533) [ClassicSimilarity], result of:
          0.03597966 = score(doc=4533,freq=2.0), product of:
            0.17322445 = queryWeight, product of:
              3.7598698 = idf(docFreq=2798, maxDocs=44218)
              0.046071928 = queryNorm
            0.20770542 = fieldWeight in 4533, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7598698 = idf(docFreq=2798, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4533)
      0.5 = coord(1/2)
  0.5 = coord(2/4)
```
Abstract

Purpose - Link analysis is an established topic within webometrics. It normally uses counts of links between sets of web sites or to sets of web sites. These link counts are derived from web crawlers or commercial search engines with the latter being the only alternative for some investigations. This paper compares link counts with URL citation counts in order to assess whether the latter could be a replacement for the former if the major search engines withdraw their advanced hyperlink search facilities. Design/methodology/approach - URL citation counts are compared with link counts for a variety of data sets used in previous webometric studies. Findings - The results show a high degree of correlation between the two but with URL citations being much less numerous, at least outside academia and business. Research limitations/implications - The results cover a small selection of 15 case studies and so the findings are only indicative. Significant differences between results indicate that the difference between link counts and URL citation counts will vary between webometric studies. Practical implications - Should link searches be withdrawn, then link analyses of less well linked non-academic, non-commercial sites would be seriously weakened, although citations based on e-mail addresses could help to make citations more numerous than links for some business and academic contexts. Originality/value - This is the first systematic study of the difference between link counts and URL citation counts in a variety of contexts and it shows that there are significant differences between the two.

Angus, E.; Thelwall, M.; Stuart, D.: General patterns of tag usage among university groups in Flickr (2008) 0.05

0.05252631 = product of:
  0.10505262 = sum of:
    0.08346482 = weight(_text_:sites in 2554) [ClassicSimilarity], result of:
      0.08346482 = score(doc=2554,freq=2.0), product of:
        0.2408473 = queryWeight, product of:
          5.227637 = idf(docFreq=644, maxDocs=44218)
          0.046071928 = queryNorm
        0.34654665 = fieldWeight in 2554, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.227637 = idf(docFreq=644, maxDocs=44218)
          0.046875 = fieldNorm(doc=2554)
    0.021587795 = product of:
      0.04317559 = sum of:
        0.04317559 = weight(_text_:design in 2554) [ClassicSimilarity], result of:
          0.04317559 = score(doc=2554,freq=2.0), product of:
            0.17322445 = queryWeight, product of:
              3.7598698 = idf(docFreq=2798, maxDocs=44218)
              0.046071928 = queryNorm
            0.24924651 = fieldWeight in 2554, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7598698 = idf(docFreq=2798, maxDocs=44218)
              0.046875 = fieldNorm(doc=2554)
      0.5 = coord(1/2)
  0.5 = coord(2/4)

Abstract: Purpose - The purpose of this research is to investigate general patterns of tag usage and determines the usefulness of the tags used within university image groups to the wider Flickr community. There has been a significant rise in the use of Web 2.0 social network web sites and online applications in recent years. One of the most popular is Flickr, an online image management application. Design/methodology/approach - This study uses a webometric data collection, classification and informetric analysis. Findings - The results show that members of university image groups tend to tag in a manner that is of use to users of the system as a whole rather than merely for the tag creator. Originality/value - This paper gives a valuable insight into the tagging practices of image groups in Flickr.

Thelwall, M.; Wilkinson, D.: Finding similar academic Web sites with links, bibliometric couplings and colinks (2004) 0.05
```
0.04665825 = product of:
  0.186633 = sum of:
    0.186633 = weight(_text_:sites in 2571) [ClassicSimilarity], result of:
      0.186633 = score(doc=2571,freq=10.0), product of:
        0.2408473 = queryWeight, product of:
          5.227637 = idf(docFreq=644, maxDocs=44218)
          0.046071928 = queryNorm
        0.7749018 = fieldWeight in 2571, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          5.227637 = idf(docFreq=644, maxDocs=44218)
          0.046875 = fieldNorm(doc=2571)
  0.25 = coord(1/4)
```
Abstract

A common task in both Webmetrics and Web information retrieval is to identify a set of Web pages or sites that are similar in content. In this paper we assess the extent to which links, colinks and couplings can be used to identify similar Web sites. As an experiment, a random sample of 500 pairs of domains from the UK academic Web were taken and human assessments of site similarity, based upon content type, were compared against ratings for the three concepts. The results show that using a combination of all three gives the highest probability of identifying similar sites, but surprisingly this was only a marginal improvement over using links alone. Another unexpected result was that high values for either colink counts or couplings were associated with only a small increased likelihood of similarity. The principal advantage of using couplings and colinks was found to be greater coverage in terms of a much larger number of pairs of sites being connected by these measures, instead of increased probability of similarity. In information retrieval terminology, this is improved recall rather than improved precision.
Thelwall, M.: Conceptualizing documentation on the Web : an evaluation of different heuristic-based models for counting links between university Web sites (2002) 0.03
```
0.034777008 = product of:
  0.13910803 = sum of:
    0.13910803 = weight(_text_:sites in 978) [ClassicSimilarity], result of:
      0.13910803 = score(doc=978,freq=8.0), product of:
        0.2408473 = queryWeight, product of:
          5.227637 = idf(docFreq=644, maxDocs=44218)
          0.046071928 = queryNorm
        0.5775777 = fieldWeight in 978, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          5.227637 = idf(docFreq=644, maxDocs=44218)
          0.0390625 = fieldNorm(doc=978)
  0.25 = coord(1/4)
```
Abstract

All known previous Web link studies have used the Web page as the primary indivisible source document for counting purposes. Arguments are presented to explain why this is not necessarily optimal and why other alternatives have the potential to produce better results. This is despite the fact that individual Web files are often the only choice if search engines are used for raw data and are the easiest basic Web unit to identify. The central issue is of defining the Web "document": that which should comprise the single indissoluble unit of coherent material. Three alternative heuristics are defined for the educational arena based upon the directory, the domain and the whole university site. These are then compared by implementing them an a set of 108 UK university institutional Web sites under the assumption that a more effective heuristic will tend to produce results that correlate more highly with institutional research productivity. It was discovered that the domain and directory models were able to successfully reduce the impact of anomalous linking behavior between pairs of Web sites, with the latter being the method of choice. Reasons are then given as to why a document model an its own cannot eliminate all anomalies in Web linking behavior. Finally, the results from all models give a clear confirmation of the very strong association between the research productivity of a UK university and the number of incoming links from its peers' Web sites.
Vaughan, L.; Thelwall, M.: Scholarly use of the Web : what are the key inducers of links to journal Web sites? (2003) 0.03
```
0.030117774 = product of:
  0.1204711 = sum of:
    0.1204711 = weight(_text_:sites in 1236) [ClassicSimilarity], result of:
      0.1204711 = score(doc=1236,freq=6.0), product of:
        0.2408473 = queryWeight, product of:
          5.227637 = idf(docFreq=644, maxDocs=44218)
          0.046071928 = queryNorm
        0.500197 = fieldWeight in 1236, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          5.227637 = idf(docFreq=644, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1236)
  0.25 = coord(1/4)
```
Abstract

Web links have been studied by information scientists for at least six years but it is only in the past two that clear evidence has emerged to show that counts of links to scholarly Web spaces (universities and departments) can correlate significantly with research measures, giving some credence to their use for the investigation of scholarly communication. This paper reports an a study to investigate the factors that influence the creation of links to journal Web sites. An empirical approach is used: collecting data and testing for significant patterns. The specific questions addressed are whether site age and site content are inducers of links to a journal's Web site as measured by the ratio of link counts to Journal Impact Factors, two variables previously discovered to be related. A new methodology for data collection is also introduced that uses the Internet Archive to obtain an earliest known creation date for Web sites. The results show that both site age and site content are significant factors for the disciplines studied: library and information science, and law. Comparisons between the two fields also show disciplinary differences in Web site characteristics. Scholars and publishers should be particularly aware that richer content an a journal's Web site tends to generate links and thus the traffic to the site.
Thelwall, M.; Wilkinson, D.: Graph structure in three national academic Webs : power laws with anomalies (2003) 0.03
```
0.029509272 = product of:
  0.11803709 = sum of:
    0.11803709 = weight(_text_:sites in 1681) [ClassicSimilarity], result of:
      0.11803709 = score(doc=1681,freq=4.0), product of:
        0.2408473 = queryWeight, product of:
          5.227637 = idf(docFreq=644, maxDocs=44218)
          0.046071928 = queryNorm
        0.49009097 = fieldWeight in 1681, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          5.227637 = idf(docFreq=644, maxDocs=44218)
          0.046875 = fieldNorm(doc=1681)
  0.25 = coord(1/4)
```
Abstract

The graph structures of three national university publicly indexable Webs from Australia, New Zealand, and the UK were analyzed. Strong scale-free regularities for page indegrees, outdegrees, and connected component sizes were in evidence, resulting in power laws similar to those previously identified for individual university Web sites and for the AItaVista-indexed Web. Anomalies were also discovered in most distributions and were tracked down to root causes. As a result, resource driven Web sites and automatically generated pages were identified as representing a significant break from the assumptions of previous power law models. It follows that attempts to track average Web linking behavior would benefit from using techniques to minimize or eliminate the impact of such anomalies.
Thelwall, M.; Kousha, K.: Academia.edu : Social network or Academic Network? (2014) 0.02
```
0.024591058 = product of:
  0.098364234 = sum of:
    0.098364234 = weight(_text_:sites in 1234) [ClassicSimilarity], result of:
      0.098364234 = score(doc=1234,freq=4.0), product of:
        0.2408473 = queryWeight, product of:
          5.227637 = idf(docFreq=644, maxDocs=44218)
          0.046071928 = queryNorm
        0.40840912 = fieldWeight in 1234, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          5.227637 = idf(docFreq=644, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1234)
  0.25 = coord(1/4)
```
Abstract

Academic social network sites Academia.edu and ResearchGate, and reference sharing sites Mendeley, Bibsonomy, Zotero, and CiteULike, give scholars the ability to publicize their research outputs and connect with each other. With millions of users, these are a significant addition to the scholarly communication and academic information-seeking eco-structure. There is thus a need to understand the role that they play and the changes, if any, that they can make to the dynamics of academic careers. This article investigates attributes of philosophy scholars on Academia.edu, introducing a median-based, time-normalizing method to adjust for time delays in joining the site. In comparison to students, faculty tend to attract more profile views but female philosophers did not attract more profile views than did males, suggesting that academic capital drives philosophy uses of the site more than does friendship and networking. Secondary analyses of law, history, and computer science confirmed the faculty advantage (in terms of higher profile views) except for females in law and females in computer science. There was also a female advantage for both faculty and students in law and computer science as well as for history students. Hence, Academia.edu overall seems to reflect a hybrid of scholarly norms (the faculty advantage) and a female advantage that is suggestive of general social networking norms. Finally, traditional bibliometric measures did not correlate with any Academia.edu metrics for philosophers, perhaps because more senior academics use the site less extensively or because of the range informal scholarly activities that cannot be measured by bibliometric methods.
Thelwall, M.: Homophily in MySpace (2009) 0.02
```
0.020866206 = product of:
  0.08346482 = sum of:
    0.08346482 = weight(_text_:sites in 2706) [ClassicSimilarity], result of:
      0.08346482 = score(doc=2706,freq=2.0), product of:
        0.2408473 = queryWeight, product of:
          5.227637 = idf(docFreq=644, maxDocs=44218)
          0.046071928 = queryNorm
        0.34654665 = fieldWeight in 2706, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.227637 = idf(docFreq=644, maxDocs=44218)
          0.046875 = fieldNorm(doc=2706)
  0.25 = coord(1/4)
```
Abstract

Social network sites like MySpace are increasingly important environments for expressing and maintaining interpersonal connections, but does online communication exacerbate or ameliorate the known tendency for offline friendships to form between similar people (homophily)? This article reports an exploratory study of the similarity between the reported attributes of pairs of active MySpace Friends based upon a systematic sample of 2,567 members joining on June 18, 2007 and Friends who commented on their profile. The results showed no evidence of gender homophily but significant evidence of homophily for ethnicity, religion, age, country, marital status, attitude towards children, sexual orientation, and reason for joining MySpace. There were also some imbalances: women and the young were disproportionately commenters, and commenters tended to have more Friends than commentees. Overall, it seems that although traditional sources of homophily are thriving in MySpace networks of active public connections, gender homophily has completely disappeared. Finally, the method used has wide potential for investigating and partially tracking homophily in society, providing early warning of socially divisive trends.
Thelwall, M.: Webometrics (2009) 0.02
```
0.020866206 = product of:
  0.08346482 = sum of:
    0.08346482 = weight(_text_:sites in 3906) [ClassicSimilarity], result of:
      0.08346482 = score(doc=3906,freq=2.0), product of:
        0.2408473 = queryWeight, product of:
          5.227637 = idf(docFreq=644, maxDocs=44218)
          0.046071928 = queryNorm
        0.34654665 = fieldWeight in 3906, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.227637 = idf(docFreq=644, maxDocs=44218)
          0.046875 = fieldNorm(doc=3906)
  0.25 = coord(1/4)
```
Abstract

Webometrics is an information science field concerned with measuring aspects of the World Wide Web (WWW) for a variety of information science research goals. It came into existence about five years after the Web was formed and has since grown to become a significant aspect of information science, at least in terms of published research. Although some webometrics research has focused on the structure or evolution of the Web itself or the performance of commercial search engines, most has used data from the Web to shed light on information provision or online communication in various contexts. Most prominently, techniques have been developed to track, map, and assess Web-based informal scholarly communication, for example, in terms of the hyperlinks between academic Web sites or the online impact of digital repositories. In addition, a range of nonacademic issues and groups of Web users have also been analyzed.
Thelwall, M.; Vaughan, L.; Björneborn, L.: Webometrics (2004) 0.02
```
0.017388504 = product of:
  0.069554016 = sum of:
    0.069554016 = weight(_text_:sites in 4279) [ClassicSimilarity], result of:
      0.069554016 = score(doc=4279,freq=2.0), product of:
        0.2408473 = queryWeight, product of:
          5.227637 = idf(docFreq=644, maxDocs=44218)
          0.046071928 = queryNorm
        0.28878886 = fieldWeight in 4279, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.227637 = idf(docFreq=644, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4279)
  0.25 = coord(1/4)
```
Abstract

Webometrics, the quantitative study of Web-related phenomena, emerged from the realization that methods originally designed for bibliometric analysis of scientific journal article citation patterns could be applied to the Web, with commercial search engines providing the raw data. Almind and Ingwersen (1997) defined the field and gave it its name. Other pioneers included Rodriguez Gairin (1997) and Aguillo (1998). Larson (1996) undertook exploratory link structure analysis, as did Rousseau (1997). Webometrics encompasses research from fields beyond information science such as communication studies, statistical physics, and computer science. In this review we concentrate on link analysis, but also cover other aspects of webometrics, including Web log fle analysis. One theme that runs through this chapter is the messiness of Web data and the need for data cleansing heuristics. The uncontrolled Web creates numerous problems in the interpretation of results, for instance, from the automatic creation or replication of links. The loose connection between top-level domain specifications (e.g., com, edu, and org) and their actual content is also a frustrating problem. For example, many .com sites contain noncommercial content, although com is ostensibly the main commercial top-level domain. Indeed, a skeptical researcher could claim that obstacles of this kind are so great that all Web analyses lack value. As will be seen, one response to this view, a view shared by critics of evaluative bibliometrics, is to demonstrate that Web data correlate significantly with some non-Web data in order to prove that the Web data are not wholly random. A practical response has been to develop increasingly sophisticated data cleansing techniques and multiple data analysis methods.

Thelwall, M.; Ruschenburg, T.: Grundlagen und Forschungsfelder der Webometrie (2006) 0.01

0.0062421104 = product of:
  0.024968442 = sum of:
    0.024968442 = product of:
      0.049936883 = sum of:
        0.049936883 = weight(_text_:22 in 77) [ClassicSimilarity], result of:
          0.049936883 = score(doc=77,freq=2.0), product of:
            0.16133605 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046071928 = queryNorm
            0.30952093 = fieldWeight in 77, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=77)
      0.5 = coord(1/2)
  0.25 = coord(1/4)

Date: 4.12.2006 12:12:22

Thelwall, M.; Buckley, K.; Paltoglou, G.: Sentiment in Twitter events (2011) 0.00

0.0046815826 = product of:
  0.01872633 = sum of:
    0.01872633 = product of:
      0.03745266 = sum of:
        0.03745266 = weight(_text_:22 in 4345) [ClassicSimilarity], result of:
          0.03745266 = score(doc=4345,freq=2.0), product of:
            0.16133605 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046071928 = queryNorm
            0.23214069 = fieldWeight in 4345, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=4345)
      0.5 = coord(1/2)
  0.25 = coord(1/4)

Date: 22. 1.2011 14:27:06

Search (12 results, page 1 of 1)

Authors

Years

Languages

Themes