Search (59 results, page 2 of 3)

Vaughan, L.; Thelwall, M.: Scholarly use of the Web : what are the key inducers of links to journal Web sites? (2003) 0.04
```
0.036790654 = product of:
  0.091976635 = sum of:
    0.08158492 = weight(_text_:web in 1236) [ClassicSimilarity], result of:
      0.08158492 = score(doc=1236,freq=18.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.5408555 = fieldWeight in 1236, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1236)
    0.010391714 = product of:
      0.020783428 = sum of:
        0.020783428 = weight(_text_:research in 1236) [ClassicSimilarity], result of:
          0.020783428 = score(doc=1236,freq=2.0), product of:
            0.13186905 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046221454 = queryNorm
            0.15760657 = fieldWeight in 1236, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1236)
      0.5 = coord(1/2)
  0.4 = coord(2/5)
```
Abstract

Web links have been studied by information scientists for at least six years but it is only in the past two that clear evidence has emerged to show that counts of links to scholarly Web spaces (universities and departments) can correlate significantly with research measures, giving some credence to their use for the investigation of scholarly communication. This paper reports an a study to investigate the factors that influence the creation of links to journal Web sites. An empirical approach is used: collecting data and testing for significant patterns. The specific questions addressed are whether site age and site content are inducers of links to a journal's Web site as measured by the ratio of link counts to Journal Impact Factors, two variables previously discovered to be related. A new methodology for data collection is also introduced that uses the Internet Archive to obtain an earliest known creation date for Web sites. The results show that both site age and site content are significant factors for the disciplines studied: library and information science, and law. Comparisons between the two fields also show disciplinary differences in Web site characteristics. Scholars and publishers should be particularly aware that richer content an a journal's Web site tends to generate links and thus the traffic to the site.
Thelwall, M.; Vaughan, L.; Björneborn, L.: Webometrics (2004) 0.04
```
0.036790654 = product of:
  0.091976635 = sum of:
    0.08158492 = weight(_text_:web in 4279) [ClassicSimilarity], result of:
      0.08158492 = score(doc=4279,freq=18.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.5408555 = fieldWeight in 4279, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4279)
    0.010391714 = product of:
      0.020783428 = sum of:
        0.020783428 = weight(_text_:research in 4279) [ClassicSimilarity], result of:
          0.020783428 = score(doc=4279,freq=2.0), product of:
            0.13186905 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046221454 = queryNorm
            0.15760657 = fieldWeight in 4279, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4279)
      0.5 = coord(1/2)
  0.4 = coord(2/5)
```
Abstract

Webometrics, the quantitative study of Web-related phenomena, emerged from the realization that methods originally designed for bibliometric analysis of scientific journal article citation patterns could be applied to the Web, with commercial search engines providing the raw data. Almind and Ingwersen (1997) defined the field and gave it its name. Other pioneers included Rodriguez Gairin (1997) and Aguillo (1998). Larson (1996) undertook exploratory link structure analysis, as did Rousseau (1997). Webometrics encompasses research from fields beyond information science such as communication studies, statistical physics, and computer science. In this review we concentrate on link analysis, but also cover other aspects of webometrics, including Web log fle analysis. One theme that runs through this chapter is the messiness of Web data and the need for data cleansing heuristics. The uncontrolled Web creates numerous problems in the interpretation of results, for instance, from the automatic creation or replication of links. The loose connection between top-level domain specifications (e.g., com, edu, and org) and their actual content is also a frustrating problem. For example, many .com sites contain noncommercial content, although com is ostensibly the main commercial top-level domain. Indeed, a skeptical researcher could claim that obstacles of this kind are so great that all Web analyses lack value. As will be seen, one response to this view, a view shared by critics of evaluative bibliometrics, is to demonstrate that Web data correlate significantly with some non-Web data in order to prove that the Web data are not wholly random. A practical response has been to develop increasingly sophisticated data cleansing techniques and multiple data analysis methods.
Jepsen, E.T.; Seiden, P.; Ingwersen, P.; Björneborn, L.; Borlund, P.: Characteristics of scientific Web publications : preliminary data gathering and analysis (2004) 0.03
```
0.034924287 = product of:
  0.08731072 = sum of:
    0.076919004 = weight(_text_:web in 3091) [ClassicSimilarity], result of:
      0.076919004 = score(doc=3091,freq=16.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.5099235 = fieldWeight in 3091, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3091)
    0.010391714 = product of:
      0.020783428 = sum of:
        0.020783428 = weight(_text_:research in 3091) [ClassicSimilarity], result of:
          0.020783428 = score(doc=3091,freq=2.0), product of:
            0.13186905 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046221454 = queryNorm
            0.15760657 = fieldWeight in 3091, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3091)
      0.5 = coord(1/2)
  0.4 = coord(2/5)
```
Abstract

Because of the increasing presence of scientific publications an the Web, combined with the existing difficulties in easily verifying and retrieving these publications, research an techniques and methods for retrieval of scientific Web publications is called for. In this article, we report an the initial steps taken toward the construction of a test collection of scientific Web publications within the subject domain of plant biology. The steps reported are those of data gathering and data analysis aiming at identifying characteristics of scientific Web publications. The data used in this article were generated based an specifically selected domain topics that are searched for in three publicly accessible search engines (Google, AlITheWeb, and AItaVista). A sample of the retrieved hits was analyzed with regard to how various publication attributes correlated with the scientific quality of the content and whether this information could be employed to harvest, filter, and rank Web publications. The attributes analyzed were inlinks, outlinks, bibliographic references, file format, language, search engine overlap, structural position (according to site structure), and the occurrence of various types of metadata. As could be expected, the ranked output differs between the three search engines. Apparently, this is caused by differences in ranking algorithms rather than the databases themselves. In fact, because scientific Web content in this subject domain receives few inlinks, both AItaVista and AlITheWeb retrieved a higher degree of accessible scientific content than Google. Because of the search engine cutoffs of accessible URLs, the feasibility of using search engine output for Web content analysis is also discussed.

Thelwall, M.: ¬A comparison of sources of links for academic Web impact factor calculations (2002) 0.03

0.034746684 = product of:
  0.08686671 = sum of:
    0.065267935 = weight(_text_:web in 4474) [ClassicSimilarity], result of:
      0.065267935 = score(doc=4474,freq=8.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.43268442 = fieldWeight in 4474, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=4474)
    0.021598773 = product of:
      0.043197546 = sum of:
        0.043197546 = weight(_text_:research in 4474) [ClassicSimilarity], result of:
          0.043197546 = score(doc=4474,freq=6.0), product of:
            0.13186905 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046221454 = queryNorm
            0.3275791 = fieldWeight in 4474, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046875 = fieldNorm(doc=4474)
      0.5 = coord(1/2)
  0.4 = coord(2/5)

Abstract: There has been much recent interest in extracting information from collections of Web links. One tool that has been used is Ingwersen's Web impact factor. It has been demonstrated that several versions of this metric can produce results that correlate with research ratings of British universities showing that, despite being a measure of a purely Internet phenomenon, the results are susceptible to a wider interpretation. This paper addresses the question of which is the best possible domain to count backlinks from, if research is the focus of interest. WIFs for British universities calculated from several different source domains are compared, primarily the .edu, .ac.uk and .uk domains, and the entire Web. The results show that all four areas produce WIFs that correlate strongly with research ratings, but that none produce incontestably superior figures. It was also found that the WIF was less able to differentiate in more homogeneous subsets of universities, although positive results are still possible.

Thelwall, M.: Interpreting social science link analysis research : a theoretical framework (2006) 0.03

0.034746684 = product of:
  0.08686671 = sum of:
    0.065267935 = weight(_text_:web in 4908) [ClassicSimilarity], result of:
      0.065267935 = score(doc=4908,freq=8.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.43268442 = fieldWeight in 4908, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=4908)
    0.021598773 = product of:
      0.043197546 = sum of:
        0.043197546 = weight(_text_:research in 4908) [ClassicSimilarity], result of:
          0.043197546 = score(doc=4908,freq=6.0), product of:
            0.13186905 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046221454 = queryNorm
            0.3275791 = fieldWeight in 4908, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046875 = fieldNorm(doc=4908)
      0.5 = coord(1/2)
  0.4 = coord(2/5)

Abstract: Link analysis in various forms is now an established technique in many different subjects, reflecting the perceived importance of links and of the Web. A critical but very difficult issue is how to interpret the results of social science link analyses. lt is argued that the dynamic nature of the Web, its lack of quality control, and the online proliferation of copying and imitation mean that methodologies operating within a highly positivist, quantitative framework are ineffective. Conversely, the sheer variety of the Web makes application of qualitative methodologies and pure reason very problematic to large-scale studies. Methodology triangulation is consequently advocated, in combination with a warning that the Web is incapable of giving definitive answers to large-scale link analysis research questions concerning social factors underlying link creation. Finally, it is claimed that although theoretical frameworks are appropriate for guiding research, a Theory of Link Analysis is not possible.

Brody, T.; Harnad, S.; Carr, L.: Earlier Web usage statistics as predictors of later citation impact (2006) 0.03

0.03460754 = product of:
  0.086518854 = sum of:
    0.065944314 = weight(_text_:web in 165) [ClassicSimilarity], result of:
      0.065944314 = score(doc=165,freq=6.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.43716836 = fieldWeight in 165, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0546875 = fieldNorm(doc=165)
    0.020574544 = product of:
      0.041149087 = sum of:
        0.041149087 = weight(_text_:research in 165) [ClassicSimilarity], result of:
          0.041149087 = score(doc=165,freq=4.0), product of:
            0.13186905 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046221454 = queryNorm
            0.31204507 = fieldWeight in 165, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.0546875 = fieldNorm(doc=165)
      0.5 = coord(1/2)
  0.4 = coord(2/5)

Abstract: The use of citation counts to assess the impact of research articles is well established. However, the citation impact of an article can only be measured several years after it has been published. As research articles are increasingly accessed through the Web, the number of times an article is downloaded can be instantly recorded and counted. One would expect the number of times an article is read to be related both to the number of times it is cited and to how old the article is. The authors analyze how short-term Web usage impact predicts medium-term citation impact. The physics e-print archive-arXiv.org-is used to test this.

Thelwall, M.: Results from a web impact factor crawler (2001) 0.03
```
0.027634421 = product of:
  0.06908605 = sum of:
    0.05438995 = weight(_text_:web in 4490) [ClassicSimilarity], result of:
      0.05438995 = score(doc=4490,freq=8.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.36057037 = fieldWeight in 4490, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4490)
    0.014696103 = product of:
      0.029392205 = sum of:
        0.029392205 = weight(_text_:research in 4490) [ClassicSimilarity], result of:
          0.029392205 = score(doc=4490,freq=4.0), product of:
            0.13186905 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046221454 = queryNorm
            0.22288933 = fieldWeight in 4490, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4490)
      0.5 = coord(1/2)
  0.4 = coord(2/5)
```
Abstract

Web impact factors, the proposed web equivalent of impact factors for journals, can be calculated by using search engines. It has been found that the results are problematic because of the variable coverage of search engines as well as their ability to give significantly different results over short periods of time. The fundamental problem is that although some search engines provide a functionality that is capable of being used for impact calculations, this is not their primary task and therefore they do not give guarantees as to performance in this respect. In this paper, a bespoke web crawler designed specifically for the calculation of reliable WIFs is presented. This crawler was used to calculate WIFs for a number of UK universities, and the results of these calculations are discussed. The principal findings were that with certain restrictions, WIFs can be calculated reliably, but do not correlate with accepted research rankings owing to the variety of material hosted on university servers. Changes to the calculations to improve the fit of the results to research rankings are proposed, but there are still inherent problems undermining the reliability of the calculation. These problems still apply if the WIF scores are taken on their own as indicators of the general impact of any area of the Internet, but with care would not apply to online journals.
Thelwall, M.: ¬A comparison of link and URL citation counting (2011) 0.02
```
0.022997916 = product of:
  0.05749479 = sum of:
    0.047103077 = weight(_text_:web in 4533) [ClassicSimilarity], result of:
      0.047103077 = score(doc=4533,freq=6.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.3122631 = fieldWeight in 4533, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4533)
    0.010391714 = product of:
      0.020783428 = sum of:
        0.020783428 = weight(_text_:research in 4533) [ClassicSimilarity], result of:
          0.020783428 = score(doc=4533,freq=2.0), product of:
            0.13186905 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046221454 = queryNorm
            0.15760657 = fieldWeight in 4533, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4533)
      0.5 = coord(1/2)
  0.4 = coord(2/5)
```
Abstract

Purpose - Link analysis is an established topic within webometrics. It normally uses counts of links between sets of web sites or to sets of web sites. These link counts are derived from web crawlers or commercial search engines with the latter being the only alternative for some investigations. This paper compares link counts with URL citation counts in order to assess whether the latter could be a replacement for the former if the major search engines withdraw their advanced hyperlink search facilities. Design/methodology/approach - URL citation counts are compared with link counts for a variety of data sets used in previous webometric studies. Findings - The results show a high degree of correlation between the two but with URL citations being much less numerous, at least outside academia and business. Research limitations/implications - The results cover a small selection of 15 case studies and so the findings are only indicative. Significant differences between results indicate that the difference between link counts and URL citation counts will vary between webometric studies. Practical implications - Should link searches be withdrawn, then link analyses of less well linked non-academic, non-commercial sites would be seriously weakened, although citations based on e-mail addresses could help to make citations more numerous than links for some business and academic contexts. Originality/value - This is the first systematic study of the difference between link counts and URL citation counts in a variety of contexts and it shows that there are significant differences between the two.
Ingwersen, P.: ¬The calculation of Web impact factors (1998) 0.02
```
0.022843778 = product of:
  0.11421889 = sum of:
    0.11421889 = weight(_text_:web in 1071) [ClassicSimilarity], result of:
      0.11421889 = score(doc=1071,freq=18.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.75719774 = fieldWeight in 1071, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1071)
  0.2 = coord(1/5)
```
Abstract

Reports investigations into the feasibility and reliability of calculating impact factors for web sites, called Web Impact Factors (Web-IF). analyzes a selection of 7 small and medium scale national and 4 large web domains as well as 6 institutional web sites over a series of snapshots taken of the web during a month. Describes the data isolation and calculation methods and discusses the tests. The results thus far demonstrate that Web-IFs are calculable with high confidence for national and sector domains whilst institutional Web-IFs should be approached with caution
Wouters, P.; Vries, R. de: Formally citing the Web (2004) 0.02
```
0.02278449 = product of:
  0.056961223 = sum of:
    0.04864785 = weight(_text_:web in 3093) [ClassicSimilarity], result of:
      0.04864785 = score(doc=3093,freq=10.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.32250395 = fieldWeight in 3093, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03125 = fieldNorm(doc=3093)
    0.008313371 = product of:
      0.016626742 = sum of:
        0.016626742 = weight(_text_:research in 3093) [ClassicSimilarity], result of:
          0.016626742 = score(doc=3093,freq=2.0), product of:
            0.13186905 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046221454 = queryNorm
            0.12608525 = fieldWeight in 3093, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.03125 = fieldNorm(doc=3093)
      0.5 = coord(1/2)
  0.4 = coord(2/5)
```
Abstract

How do authors refer to Web-based information sources in their formal scientific publications? It is not yet weIl known how scientists and scholars actually include new types of information sources, available through the new media, in their published work. This article reports an a comparative study of the lists of references in 38 scientific journals in five different scientific and social scientific fields. The fields are sociology, library and information science, biochemistry and biotechnology, neuroscience, and the mathematics of computing. As is weIl known, references, citations, and hyperlinks play different roles in academic publishing and communication. Our study focuses an hyperlinks as attributes of references in formal scholarly publications. The study developed and applied a method to analyze the differential roles of publishing media in the analysis of scientific and scholarly literature references. The present secondary databases that include reference and citation data (the Web of Science) cannot be used for this type of research. By the automated processing and analysis of the full text of scientific and scholarly articles, we were able to extract the references and hyperlinks contained in these references in relation to other features of the scientific and scholarly literature. Our findings show that hyperlinking references are indeed, as expected, abundantly present in the formal literature. They also tend to cite more recent literature than the average reference. The large majority of the references are to Web instances of traditional scientific journals. Other types of Web-based information sources are less weIl represented in the lists of references, except in the case of pure e-journals. We conclude that this can be explained by taking the role of the publisher into account. Indeed, it seems that the shift from print-based to electronic publishing has created new roles for the publisher. By shaping the way scientific references are hyperlinking to other information sources, the publisher may have a large impact an the availability of scientific and scholarly information.
Koehler, W.: Web page change and persistence : a four-year longitudinal study (2002) 0.02
```
0.021646928 = product of:
  0.10823464 = sum of:
    0.10823464 = weight(_text_:web in 203) [ClassicSimilarity], result of:
      0.10823464 = score(doc=203,freq=22.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.717526 = fieldWeight in 203, product of:
          4.690416 = tf(freq=22.0), with freq of:
            22.0 = termFreq=22.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=203)
  0.2 = coord(1/5)
```
Abstract

Changes in the topography of the Web can be expressed in at least four ways: (1) more sites on more servers in more places, (2) more pages and objects added to existing sites and pages, (3) changes in traffic, and (4) modifications to existing text, graphic, and other Web objects. This article does not address the first three factors (more sites, more pages, more traffic) in the growth of the Web. It focuses instead on changes to an existing set of Web documents. The article documents changes to an aging set of Web pages, first identified and "collected" in December 1996 and followed weekly thereafter. Results are reported through February 2001. The article addresses two related phenomena: (1) the life cycle of Web objects, and (2) changes to Web objects. These data reaffirm that the half-life of a Web page is approximately 2 years. There is variation among Web pages by top-level domain and by page type (navigation, content). Web page content appears to stabilize over time; aging pages change less often than once they did
Park, H.W.; Barnett, G.A.; Nam, I.-Y.: Hyperlink - affiliation network structure of top Web sites : examining affiliates with hyperlink in Korea (2002) 0.02
```
0.021537323 = product of:
  0.10768661 = sum of:
    0.10768661 = weight(_text_:web in 584) [ClassicSimilarity], result of:
      0.10768661 = score(doc=584,freq=16.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.71389294 = fieldWeight in 584, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0546875 = fieldNorm(doc=584)
  0.2 = coord(1/5)
```
Abstract

This article argues that individual Web sites form hyperlink-affiliations with others for the purpose of strengthening their individual trust, expertness, and safety. It describes the hyperlink-affiliation network structure of Korea's top 152 Web sites. The data were obtained from their Web sites for October 2000. The results indicate that financial Web sites, such as credit card and stock Web sites, occupy the most central position in the network. A cluster analysis reveals that the structure of the hyperlink-affiliation network is influenced by the financial Web sites with which others are affiliated. These findings are discussed from the perspective of Web site credibility.

Youngen, G.K.: Citation patterns to traditional and electronic preprints in the published literature (1998) 0.02

0.018041609 = product of:
  0.045104023 = sum of:
    0.032633968 = weight(_text_:web in 3360) [ClassicSimilarity], result of:
      0.032633968 = score(doc=3360,freq=2.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.21634221 = fieldWeight in 3360, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=3360)
    0.012470056 = product of:
      0.024940113 = sum of:
        0.024940113 = weight(_text_:research in 3360) [ClassicSimilarity], result of:
          0.024940113 = score(doc=3360,freq=2.0), product of:
            0.13186905 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046221454 = queryNorm
            0.18912788 = fieldWeight in 3360, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046875 = fieldNorm(doc=3360)
      0.5 = coord(1/2)
  0.4 = coord(2/5)

Abstract: The number of physics and astronomy preprints (manuscripts intended for publication but circulated for peer comment prior to submission) available electronically has increased dramatically over the past 5 years and Internet accessible preprint Web servers at the Stanford Accelerator Laboratory (SLAC) and the Los Alamos National Laboratoty (LANL) provide unrestricted access to citations and full text of many of these papers long before they appear in print. Includes data for periodicals ranked by number of citations to preprints and electronic preprints (e-prints). Identifies the growing importance of e-prints in the published literature and addresses areas of concern regarding their future role in scientific communication, including: inclusion of e-prints in abstracting and indexing services; connecting electronic periodicals with e-prints; guidelines for withdrawal and revision of e-prints; and maintaining the integritiy of the e-print servers
Source: College and research libraries. 59(1998) no.5, S.448-456

Menczer, F.: Lexical and semantic clustering by Web links (2004) 0.02
```
0.017268276 = product of:
  0.08634137 = sum of:
    0.08634137 = weight(_text_:web in 3090) [ClassicSimilarity], result of:
      0.08634137 = score(doc=3090,freq=14.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.57238775 = fieldWeight in 3090, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=3090)
  0.2 = coord(1/5)
```
Abstract

Recent Web-searching and -mining tools are combining text and link analysis to improve ranking and crawling algorithms. The central assumption behind such approaches is that there is a correiation between the graph structure of the Web and the text and meaning of pages. Here I formalize and empirically evaluate two general conjectures drawing connections from link information to lexical and semantic Web content. The link-content conjecture states that a page is similar to the pages that link to it, and the link-cluster conjecture that pages about the same topic are clustered together. These conjectures are offen simply assumed to hold, and Web search tools are built an such assumptions. The present quantitative confirmation sheds light an the connection between the success of the latest Web-mining techniques and the small world topology of the Web, with encouraging implications for the design of better crawling algorithms.
Vaughan, L.; Shaw, D.: Web citation data for impact assessment : a comparison of four science disciplines (2005) 0.02
```
0.017199611 = product of:
  0.08599806 = sum of:
    0.08599806 = weight(_text_:web in 3880) [ClassicSimilarity], result of:
      0.08599806 = score(doc=3880,freq=20.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.5701118 = fieldWeight in 3880, product of:
          4.472136 = tf(freq=20.0), with freq of:
            20.0 = termFreq=20.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3880)
  0.2 = coord(1/5)
```
Abstract

The number and type of Web citations to journal articles in four areas of science are examined: biology, genetics, medicine, and multidisciplinary sciences. For a sample of 5,972 articles published in 114 journals, the median Web citation counts per journal article range from 6.2 in medicine to 10.4 in genetics. About 30% of Web citations in each area indicate intellectual impact (citations from articles or class readings, in contrast to citations from bibliographic services or the author's or journal's home page). Journals receiving more Web citations also have higher percentages of citations indicating intellectual impact. There is significant correlation between the number of citations reported in the databases from the Institute for Scientific Information (ISI, now Thomson Scientific) and the number of citations retrieved using the Google search engine (Web citations). The correlation is much weaker for journals published outside the United Kingdom or United States and for multidisciplinary journals. Web citation numbers are higher than ISI citation counts, suggesting that Web searches might be conducted for an earlier or a more fine-grained assessment of an article's impact. The Web-evident impact of non-UK/USA publications might provide a balance to the geographic or cultural biases observed in ISI's data, although the stability of Web citation counts is debatable.
Faba-Pérez, C.; Zapico-Alonso, F.; Guerrero-Bote, V.P.; Moya-Anegón, F. de: Comparative analysis of webometric measurements in thematic environments (2005) 0.02
```
0.015229187 = product of:
  0.07614593 = sum of:
    0.07614593 = weight(_text_:web in 3554) [ClassicSimilarity], result of:
      0.07614593 = score(doc=3554,freq=8.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.50479853 = fieldWeight in 3554, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0546875 = fieldNorm(doc=3554)
  0.2 = coord(1/5)
```
Abstract

There have been many attempts to evaluate Web spaces an the basis of the information that they provide, their form or functionality, or even the importance given to each of them by the Web itself. The indicators that have been developed for this purpose fall into two groups: those based an the study of a Web space's formal characteristics, and those related to its link structure. In this study we examine most of the webometric indicators that have been proposed in the literature together with others of our own design by applying them to a set of thematically related Web spaces and analyzing the relationships between the different indicators.

Cronin, B.: Bibliometrics and beyond : some thoughts on web-based citation analysis (2001) 0.02

0.015229187 = product of:
  0.07614593 = sum of:
    0.07614593 = weight(_text_:web in 3890) [ClassicSimilarity], result of:
      0.07614593 = score(doc=3890,freq=2.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.50479853 = fieldWeight in 3890, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.109375 = fieldNorm(doc=3890)
  0.2 = coord(1/5)

Thelwall, M.; Sud, P.: ¬A comparison of methods for collecting web citation data for academic organizations (2011) 0.02
```
0.015034676 = product of:
  0.03758669 = sum of:
    0.027194975 = weight(_text_:web in 4626) [ClassicSimilarity], result of:
      0.027194975 = score(doc=4626,freq=2.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.18028519 = fieldWeight in 4626, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4626)
    0.010391714 = product of:
      0.020783428 = sum of:
        0.020783428 = weight(_text_:research in 4626) [ClassicSimilarity], result of:
          0.020783428 = score(doc=4626,freq=2.0), product of:
            0.13186905 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046221454 = queryNorm
            0.15760657 = fieldWeight in 4626, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4626)
      0.5 = coord(1/2)
  0.4 = coord(2/5)
```
Abstract

The primary webometric method for estimating the online impact of an organization is to count links to its website. Link counts have been available from commercial search engines for over a decade but this was set to end by early 2012 and so a replacement is needed. This article compares link counts to two alternative methods: URL citations and organization title mentions. New variations of these methods are also introduced. The three methods are compared against each other using Yahoo!. Two of the three methods (URL citations and organization title mentions) are also compared against each other using Bing. Evidence from a case study of 131 UK universities and 49 US Library and Information Science (LIS) departments suggests that Bing's Hit Count Estimates (HCEs) for popular title searches are not useful for webometric research but that Yahoo!'s HCEs for all three types of search and Bing's URL citation HCEs seem to be consistent. For exact URL counts the results of all three methods in Yahoo! and both methods in Bing are also consistent. Four types of accuracy factors are also introduced and defined: search engine coverage, search engine retrieval variation, search engine retrieval anomalies, and query polysemy.
Thelwall, M.; Wilkinson, D.: Finding similar academic Web sites with links, bibliometric couplings and colinks (2004) 0.01
```
0.014594356 = product of:
  0.072971776 = sum of:
    0.072971776 = weight(_text_:web in 2571) [ClassicSimilarity], result of:
      0.072971776 = score(doc=2571,freq=10.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.48375595 = fieldWeight in 2571, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=2571)
  0.2 = coord(1/5)
```
Abstract

A common task in both Webmetrics and Web information retrieval is to identify a set of Web pages or sites that are similar in content. In this paper we assess the extent to which links, colinks and couplings can be used to identify similar Web sites. As an experiment, a random sample of 500 pairs of domains from the UK academic Web were taken and human assessments of site similarity, based upon content type, were compared against ratings for the three concepts. The results show that using a combination of all three gives the highest probability of identifying similar sites, but surprisingly this was only a marginal improvement over using links alone. Another unexpected result was that high values for either colink counts or couplings were associated with only a small increased likelihood of similarity. The principal advantage of using couplings and colinks was found to be greater coverage in terms of a much larger number of pairs of sites being connected by these measures, instead of increased probability of similarity. In information retrieval terminology, this is improved recall rather than improved precision.
Prime-Claverie, C.; Beigbeder, M.; Lafouge, T.: Transposition of the cocitation method with a view to classifying Web pages (2004) 0.01
```
0.014594356 = product of:
  0.072971776 = sum of:
    0.072971776 = weight(_text_:web in 3095) [ClassicSimilarity], result of:
      0.072971776 = score(doc=3095,freq=10.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.48375595 = fieldWeight in 3095, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=3095)
  0.2 = coord(1/5)
```
Abstract

The Web is a huge source of information, and one of the main problems facing users is finding documents which correspond to their requirements. Apart from the problem of thematic relevance, the documents retrieved by search engines do not always meet the users' expectations. The document may be too general, or conversely too specialized, or of a different type from what the user is looking for, and so forth. We think that adding metadata to pages can considerably improve the process of searching for information an the Web. This article presents a possible typology for Web sites and pages, as weIl as a method for propagating metadata values, based an the study of the Web graph and more specifically the method of cocitation in this graph.

Search (59 results, page 2 of 3)

Authors

Years

Languages

Types

Themes

Subjects

Classifications