Search (43 results, page 1 of 3)

Thelwall, M.: Webometrics (2009) 0.10

0.10085578 = product of:
  0.16809297 = sum of:
    0.060152818 = weight(_text_:wide in 3906) [ClassicSimilarity], result of:
      0.060152818 = score(doc=3906,freq=2.0), product of:
        0.20479609 = queryWeight, product of:
          4.4307585 = idf(docFreq=1430, maxDocs=44218)
          0.046221454 = queryNorm
        0.29372054 = fieldWeight in 3906, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.4307585 = idf(docFreq=1430, maxDocs=44218)
          0.046875 = fieldNorm(doc=3906)
    0.08634137 = weight(_text_:web in 3906) [ClassicSimilarity], result of:
      0.08634137 = score(doc=3906,freq=14.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.57238775 = fieldWeight in 3906, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=3906)
    0.021598773 = product of:
      0.043197546 = sum of:
        0.043197546 = weight(_text_:research in 3906) [ClassicSimilarity], result of:
          0.043197546 = score(doc=3906,freq=6.0), product of:
            0.13186905 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046221454 = queryNorm
            0.3275791 = fieldWeight in 3906, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046875 = fieldNorm(doc=3906)
      0.5 = coord(1/2)
  0.6 = coord(3/5)

Abstract: Webometrics is an information science field concerned with measuring aspects of the World Wide Web (WWW) for a variety of information science research goals. It came into existence about five years after the Web was formed and has since grown to become a significant aspect of information science, at least in terms of published research. Although some webometrics research has focused on the structure or evolution of the Web itself or the performance of commercial search engines, most has used data from the Web to shed light on information provision or online communication in various contexts. Most prominently, techniques have been developed to track, map, and assess Web-based informal scholarly communication, for example, in terms of the hyperlinks between academic Web sites or the online impact of digital repositories. In addition, a range of nonacademic issues and groups of Web users have also been analyzed.

Bar-Ilan, J.; Peritz, B.C.: Informetric theories and methods for exploring the Internet : an analytical survey of recent research literature (2002) 0.07

0.070636146 = product of:
  0.1177269 = sum of:
    0.060152818 = weight(_text_:wide in 813) [ClassicSimilarity], result of:
      0.060152818 = score(doc=813,freq=2.0), product of:
        0.20479609 = queryWeight, product of:
          4.4307585 = idf(docFreq=1430, maxDocs=44218)
          0.046221454 = queryNorm
        0.29372054 = fieldWeight in 813, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.4307585 = idf(docFreq=1430, maxDocs=44218)
          0.046875 = fieldNorm(doc=813)
    0.032633968 = weight(_text_:web in 813) [ClassicSimilarity], result of:
      0.032633968 = score(doc=813,freq=2.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.21634221 = fieldWeight in 813, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=813)
    0.024940113 = product of:
      0.049880225 = sum of:
        0.049880225 = weight(_text_:research in 813) [ClassicSimilarity], result of:
          0.049880225 = score(doc=813,freq=8.0), product of:
            0.13186905 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046221454 = queryNorm
            0.37825575 = fieldWeight in 813, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046875 = fieldNorm(doc=813)
      0.5 = coord(1/2)
  0.6 = coord(3/5)

Abstract: The Internet, and more specifically the World Wide Web, is quickly becoming one of our main information sources. Systematic evaluation and analysis can help us understand how this medium works, grows, and changes, and how it influences our lives and research. New approaches in informetrics can provide an appropriate means towards achieving the above goals, and towards establishing a sound theory. This paper presents a selective review of research based on the Internet, using bibliometric and informetric methods and tools. Some of these studies clearly show the applicability of bibliometric laws to the Internet, while others establish new definitions and methods based on the respective definitions for printed sources. Both informetrics and Internet research can gain from these additional methods.

Cothey, V.: Web-crawling reliability (2004) 0.07

0.06836396 = product of:
  0.1709099 = sum of:
    0.07017829 = weight(_text_:wide in 3089) [ClassicSimilarity], result of:
      0.07017829 = score(doc=3089,freq=2.0), product of:
        0.20479609 = queryWeight, product of:
          4.4307585 = idf(docFreq=1430, maxDocs=44218)
          0.046221454 = queryNorm
        0.342674 = fieldWeight in 3089, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.4307585 = idf(docFreq=1430, maxDocs=44218)
          0.0546875 = fieldNorm(doc=3089)
    0.100731604 = weight(_text_:web in 3089) [ClassicSimilarity], result of:
      0.100731604 = score(doc=3089,freq=14.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.6677857 = fieldWeight in 3089, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0546875 = fieldNorm(doc=3089)
  0.4 = coord(2/5)

Abstract: In this article, I investigate the reliability, in the social science sense, of collecting informetric data about the World Wide Web by Web crawling. The investigation includes a critical examination of the practice of Web crawling and contrasts the results of content crawling with the results of link crawling. It is shown that Web crawling by search engines is intentionally biased and selective. I also report the results of a [arge-scale experimental simulation of Web crawling that illustrates the effects of different crawling policies an data collection. It is concluded that the reliability of Web crawling as a data collection technique is improved by fuller reporting of relevant crawling policies.

fwt: Webseiten liegen im Schnitt nur 19 Klicks auseinander (2001) 0.06
```
0.056637056 = product of:
  0.14159264 = sum of:
    0.08506894 = weight(_text_:wide in 5962) [ClassicSimilarity], result of:
      0.08506894 = score(doc=5962,freq=4.0), product of:
        0.20479609 = queryWeight, product of:
          4.4307585 = idf(docFreq=1430, maxDocs=44218)
          0.046221454 = queryNorm
        0.4153836 = fieldWeight in 5962, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          4.4307585 = idf(docFreq=1430, maxDocs=44218)
          0.046875 = fieldNorm(doc=5962)
    0.056523696 = weight(_text_:web in 5962) [ClassicSimilarity], result of:
      0.056523696 = score(doc=5962,freq=6.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.37471575 = fieldWeight in 5962, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=5962)
  0.4 = coord(2/5)
```
Abstract

"Dokumente im World Wide Web liegen durchschnittlich 19 Mausklicks voneinander entfernt - angesichts von schätzungsweise mehr als einer Milliarde Seiten erstaunlich nahe. Albert-Lazlo Barabai vom Institut für Physik der University von Notre Dame (US-Staat Indiana) stellt seine Studie in der britischen Fachzeitschrift Physics World (Juli 2001, S. 33) vor. Der Statistiker konstruierte im Rechner zunächst Modelle von großen Computernetzwerken. Grundlage für diese Abbilder war die Analyse eines kleinen Teils der Verbindungen im Web, die der Wissenschaftler automatisch von einem Programm hatte prüfen lassen. Um seine Ergebnisse zu erklären, vergleicht Barabai das World Wide Web mit den Verbindungen internationaler Fluglinien. Dort gebe es zahlreiche Flughäfen, die meist nur mit anderen Flugplätzen in ihrer näheren Umgebung in Verbindung stünden. Diese kleineren Verteiler stehen ihrerseits mit einigen wenigen großen Airports wie Frankfurt, New York oder Hongkong in Verbindung. Ähnlich sei es im Netz, wo wenige große Server die Verteilung großer Datenmengen übernähmen und weite Entfernungen überbrückten. Damit seien die Online-Wege vergleichsweise kurz. Die Untersuchung spiegelt allerdings die Situation des Jahres 1999 wider. Seinerzeit gab es vermutlich 800 Millionen Knoten."

Thelwall, M.; Vaughan, L.: Webometrics : an introduction to the special issue (2004) 0.05

0.04948629 = product of:
  0.12371573 = sum of:
    0.080203764 = weight(_text_:wide in 2908) [ClassicSimilarity], result of:
      0.080203764 = score(doc=2908,freq=2.0), product of:
        0.20479609 = queryWeight, product of:
          4.4307585 = idf(docFreq=1430, maxDocs=44218)
          0.046221454 = queryNorm
        0.3916274 = fieldWeight in 2908, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.4307585 = idf(docFreq=1430, maxDocs=44218)
          0.0625 = fieldNorm(doc=2908)
    0.04351196 = weight(_text_:web in 2908) [ClassicSimilarity], result of:
      0.04351196 = score(doc=2908,freq=2.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.2884563 = fieldWeight in 2908, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0625 = fieldNorm(doc=2908)
  0.4 = coord(2/5)

Abstract: Webometrics, the quantitative study of Web phenomena, is a field encompassing contributions from information science, computer science, and statistical physics. Its methodology draws especially from bibliometrics. This special issue presents contributions that both push for ward the field and illustrate a wide range of webometric approaches.

Maharana, B.; Nayak, K.; Sahu, N.K.: Scholarly use of web resources in LIS research : a citation analysis (2006) 0.05
```
0.04642074 = product of:
  0.11605185 = sum of:
    0.098052874 = weight(_text_:web in 53) [ClassicSimilarity], result of:
      0.098052874 = score(doc=53,freq=26.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.65002745 = fieldWeight in 53, product of:
          5.0990195 = tf(freq=26.0), with freq of:
            26.0 = termFreq=26.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=53)
    0.017998978 = product of:
      0.035997957 = sum of:
        0.035997957 = weight(_text_:research in 53) [ClassicSimilarity], result of:
          0.035997957 = score(doc=53,freq=6.0), product of:
            0.13186905 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046221454 = queryNorm
            0.2729826 = fieldWeight in 53, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.0390625 = fieldNorm(doc=53)
      0.5 = coord(1/2)
  0.4 = coord(2/5)
```
Abstract

Purpose - The essential purpose of this paper is to measure the amount of web resources used for scholarly contributions in the area of library and information science (LIS) in India. It further aims to make an analysis of the nature and type of web resources and studies the various standards for web citations. Design/methodology/approach - In this study, the result of analysis of 292 web citations spread over 95 scholarly papers published in the proceedings of the National Conference of the Society for Information Science, India (SIS-2005) has been reported. All the 292 web citations were scanned and data relating to types of web domains, file formats, styles of citations, etc., were collected through a structured check list. The data thus obtained were systematically analyzed, figurative representations were made and appropriate interpretations were drawn. Findings - The study revealed that 292 (34.88 per cent) out of 837 were web citations, proving a significant correlation between the use of Internet resources and research productivity of LIS professionals in India. The highest number of web citations (35.6 per cent) was from .edu/.ac type domains. Most of the web resources (46.9 per cent) cited in the study were hypertext markup language (HTML) files. Originality/value - The paper is the result of an original analysis of web citations undertaken in order to study the dependence of LIS professionals in India on web sources for their scholarly contributions. This carries research value for web content providers, authors and researchers in LIS.

Hong, T.: ¬The influence of structural and message features an Web site credibility (2006) 0.04

0.044148784 = product of:
  0.11037196 = sum of:
    0.0979019 = weight(_text_:web in 5787) [ClassicSimilarity], result of:
      0.0979019 = score(doc=5787,freq=18.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.64902663 = fieldWeight in 5787, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=5787)
    0.012470056 = product of:
      0.024940113 = sum of:
        0.024940113 = weight(_text_:research in 5787) [ClassicSimilarity], result of:
          0.024940113 = score(doc=5787,freq=2.0), product of:
            0.13186905 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046221454 = queryNorm
            0.18912788 = fieldWeight in 5787, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046875 = fieldNorm(doc=5787)
      0.5 = coord(1/2)
  0.4 = coord(2/5)

Abstract: This article explores the associations that message features and Web structural features have with perceptions of Web site credibility. In a within-subjects experiment, 84 participants actively located health-related Web sites an the basis of two tasks that differed in task specificity and complexity. Web sites that were deemed most credible were content analyzed for message features and structural features that have been found to be associated with perceptions of source credibility. Regression analyses indicated that message features predicted perceived Web site credibility for both searches when controlling for Internet experience and issue involvement. Advertisements and structural features had no significant effects an perceived Web site credibility. Institutionaffiliated domain names (.gov, org, edu) predicted Web site credibility, but only in the general search, which was more difficult. Implications of results are discussed in terms of online credibility research and Web site design.

Zhang, Y.; Jansen, B.J.; Spink, A.: Identification of factors predicting clickthrough in Web searching using neural network analysis (2009) 0.04

0.043466296 = product of:
  0.108665735 = sum of:
    0.046151403 = weight(_text_:web in 2742) [ClassicSimilarity], result of:
      0.046151403 = score(doc=2742,freq=4.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.3059541 = fieldWeight in 2742, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=2742)
    0.06251433 = sum of:
      0.024940113 = weight(_text_:research in 2742) [ClassicSimilarity], result of:
        0.024940113 = score(doc=2742,freq=2.0), product of:
          0.13186905 = queryWeight, product of:
            2.8529835 = idf(docFreq=6931, maxDocs=44218)
            0.046221454 = queryNorm
          0.18912788 = fieldWeight in 2742, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            2.8529835 = idf(docFreq=6931, maxDocs=44218)
            0.046875 = fieldNorm(doc=2742)
      0.037574213 = weight(_text_:22 in 2742) [ClassicSimilarity], result of:
        0.037574213 = score(doc=2742,freq=2.0), product of:
          0.16185966 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046221454 = queryNorm
          0.23214069 = fieldWeight in 2742, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046875 = fieldNorm(doc=2742)
  0.4 = coord(2/5)

Abstract: In this research, we aim to identify factors that significantly affect the clickthrough of Web searchers. Our underlying goal is determine more efficient methods to optimize the clickthrough rate. We devise a clickthrough metric for measuring customer satisfaction of search engine results using the number of links visited, number of queries a user submits, and rank of clicked links. We use a neural network to detect the significant influence of searching characteristics on future user clickthrough. Our results show that high occurrences of query reformulation, lengthy searching duration, longer query length, and the higher ranking of prior clicked links correlate positively with future clickthrough. We provide recommendations for leveraging these findings for improving the performance of search engine retrieval and result ranking, along with implications for search engine marketing.
Date: 22. 3.2009 17:49:11

Vaughan, L.; Shaw , D.: Bibliographic and Web citations : what Is the difference? (2003) 0.04
```
0.043377835 = product of:
  0.10844459 = sum of:
    0.098052874 = weight(_text_:web in 5176) [ClassicSimilarity], result of:
      0.098052874 = score(doc=5176,freq=26.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.65002745 = fieldWeight in 5176, product of:
          5.0990195 = tf(freq=26.0), with freq of:
            26.0 = termFreq=26.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5176)
    0.010391714 = product of:
      0.020783428 = sum of:
        0.020783428 = weight(_text_:research in 5176) [ClassicSimilarity], result of:
          0.020783428 = score(doc=5176,freq=2.0), product of:
            0.13186905 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046221454 = queryNorm
            0.15760657 = fieldWeight in 5176, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5176)
      0.5 = coord(1/2)
  0.4 = coord(2/5)
```
Abstract

Vaughn, and Shaw look at the relationship between traditional citation and Web citation (not hyperlinks but rather textual mentions of published papers). Using English language research journals in ISI's 2000 Journal Citation Report - Information and Library Science category - 1209 full length papers published in 1997 in 46 journals were identified. Each was searched in Social Science Citation Index and on the Web using Google phrase search by entering the title in quotation marks, and followed for distinction where necessary with sub-titles, author's names, and journal title words. After removing obvious false drops, the number of web sites was recorded for comparison with the SSCI counts. A second sample from 1992 was also collected for examination. There were a total of 16,371 web citations to the selected papers. The top and bottom ranked four journals were then examined and every third citation to every third paper was selected and classified as to source type, domain, and country of origin. Web counts are much higher than ISI citation counts. Of the 46 journals from 1997, 26 demonstrated a significant correlation between Web and traditional citation counts, and 11 of the 15 in the 1992 sample also showed significant correlation. Journal impact factor in 1998 and 1999 correlated significantly with average Web citations per journal in the 1997 data, but at a low level. Thirty percent of web citations come from other papers posted on the web, and 30percent from listings of web based bibliographic services, while twelve percent come from class reading lists. High web citation journals often have web accessible tables of content.
Thelwall, M.: Conceptualizing documentation on the Web : an evaluation of different heuristic-based models for counting links between university Web sites (2002) 0.04
```
0.04195666 = product of:
  0.10489164 = sum of:
    0.09019554 = weight(_text_:web in 978) [ClassicSimilarity], result of:
      0.09019554 = score(doc=978,freq=22.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.59793836 = fieldWeight in 978, product of:
          4.690416 = tf(freq=22.0), with freq of:
            22.0 = termFreq=22.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=978)
    0.014696103 = product of:
      0.029392205 = sum of:
        0.029392205 = weight(_text_:research in 978) [ClassicSimilarity], result of:
          0.029392205 = score(doc=978,freq=4.0), product of:
            0.13186905 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046221454 = queryNorm
            0.22288933 = fieldWeight in 978, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.0390625 = fieldNorm(doc=978)
      0.5 = coord(1/2)
  0.4 = coord(2/5)
```
Abstract

All known previous Web link studies have used the Web page as the primary indivisible source document for counting purposes. Arguments are presented to explain why this is not necessarily optimal and why other alternatives have the potential to produce better results. This is despite the fact that individual Web files are often the only choice if search engines are used for raw data and are the easiest basic Web unit to identify. The central issue is of defining the Web "document": that which should comprise the single indissoluble unit of coherent material. Three alternative heuristics are defined for the educational arena based upon the directory, the domain and the whole university site. These are then compared by implementing them an a set of 108 UK university institutional Web sites under the assumption that a more effective heuristic will tend to produce results that correlate more highly with institutional research productivity. It was discovered that the domain and directory models were able to successfully reduce the impact of anomalous linking behavior between pairs of Web sites, with the latter being the method of choice. Reasons are then given as to why a document model an its own cannot eliminate all anomalies in Web linking behavior. Finally, the results from all models give a clear confirmation of the very strong association between the research productivity of a UK university and the number of incoming links from its peers' Web sites.
Bar-Ilan, J.: ¬The Web as an information source on informetrics? : A content analysis (2000) 0.04
```
0.041909147 = product of:
  0.104772866 = sum of:
    0.09230281 = weight(_text_:web in 4587) [ClassicSimilarity], result of:
      0.09230281 = score(doc=4587,freq=16.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.6119082 = fieldWeight in 4587, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=4587)
    0.012470056 = product of:
      0.024940113 = sum of:
        0.024940113 = weight(_text_:research in 4587) [ClassicSimilarity], result of:
          0.024940113 = score(doc=4587,freq=2.0), product of:
            0.13186905 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046221454 = queryNorm
            0.18912788 = fieldWeight in 4587, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046875 = fieldNorm(doc=4587)
      0.5 = coord(1/2)
  0.4 = coord(2/5)
```
Abstract

This article addresses the question of whether the Web can serve as an information source for research. Specifically, it analyzes by way of content analysis the Web pages retrieved by the major search engines on a particular date (June 7, 1998), as a result of the query 'informetrics OR informetric'. In 807 out of the 942 retrieved pages, the search terms were mentioned in the context of information science. Over 70% of the pages contained only indirect information on the topic, in the form of hypertext links and bibliographical references without annotation. The bibliographical references extracted from the Web pages were analyzed, and lists of most productive authors, most cited authors, works, and sources were compiled. The list of reference obtained from the Web was also compared to data retrieved from commercial databases. For most cases, the list of references extracted from the Web outperformed the commercial, bibliographic databases. The results of these comparisons indicate that valuable, freely available data is hidden in the Web waiting to be extracted from the millions of Web pages

Payne, N.; Thelwall, M.: Mathematical models for academic webs : linear relationship or non-linear power law? (2005) 0.04

0.03868819 = product of:
  0.09672047 = sum of:
    0.07614593 = weight(_text_:web in 1066) [ClassicSimilarity], result of:
      0.07614593 = score(doc=1066,freq=8.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.50479853 = fieldWeight in 1066, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1066)
    0.020574544 = product of:
      0.041149087 = sum of:
        0.041149087 = weight(_text_:research in 1066) [ClassicSimilarity], result of:
          0.041149087 = score(doc=1066,freq=4.0), product of:
            0.13186905 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046221454 = queryNorm
            0.31204507 = fieldWeight in 1066, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1066)
      0.5 = coord(1/2)
  0.4 = coord(2/5)

Abstract: Previous studies of academic web interlinking have tended to hypothesise that the relationship between the research of a university and links to or from its web site should follow a linear trend, yet the typical distribution of web data, in general, seems to be a non-linear power law. This paper assesses whether a linear trend or a power law is the most appropriate method with which to model the relationship between research and web site size or outlinks. Following linear regression, analysis of the confidence intervals for the logarithmic graphs, and analysis of the outliers, the results suggest that a linear trend is more appropriate than a non-linear power law.

Vaughan, L.: Visualizing linguistic and cultural differences using Web co-link data (2006) 0.04

0.036962654 = product of:
  0.09240664 = sum of:
    0.07993658 = weight(_text_:web in 184) [ClassicSimilarity], result of:
      0.07993658 = score(doc=184,freq=12.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.5299281 = fieldWeight in 184, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=184)
    0.012470056 = product of:
      0.024940113 = sum of:
        0.024940113 = weight(_text_:research in 184) [ClassicSimilarity], result of:
          0.024940113 = score(doc=184,freq=2.0), product of:
            0.13186905 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046221454 = queryNorm
            0.18912788 = fieldWeight in 184, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046875 = fieldNorm(doc=184)
      0.5 = coord(1/2)
  0.4 = coord(2/5)

Abstract: The study examined Web co-links to Canadian university Web sites. Multidimensional scaling (MDS) was used to analyze and visualize co-link data as was done in co-citation analysis. Co-link data were collected in ways that would reflect three different views, the global view, the French Canada view, and the English Canada view. Mapping results of the three data sets accurately reflected the ways Canadians see the universities and clearly showed the linguistic and cultural differences within Canadian society. This shows that Web co-linking is not a random phenomenon and that co-link data contain useful information for Web data mining. It is proposed that the method developed in the study can be applied to other contexts such as analyzing relationships of different organizations or countries. This kind of research is promising because of the dynamics and the diversity of the Web.

Thelwall, M.: Extracting macroscopic information from Web links (2001) 0.04
```
0.03682728 = product of:
  0.0920682 = sum of:
    0.06661381 = weight(_text_:web in 6851) [ClassicSimilarity], result of:
      0.06661381 = score(doc=6851,freq=12.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.4416067 = fieldWeight in 6851, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=6851)
    0.025454395 = product of:
      0.05090879 = sum of:
        0.05090879 = weight(_text_:research in 6851) [ClassicSimilarity], result of:
          0.05090879 = score(doc=6851,freq=12.0), product of:
            0.13186905 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046221454 = queryNorm
            0.38605565 = fieldWeight in 6851, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.0390625 = fieldNorm(doc=6851)
      0.5 = coord(1/2)
  0.4 = coord(2/5)
```
Abstract

Much has been written about the potential and pitfalls of macroscopic Web-based link analysis, yet there have been no studies that have provided clear statistical evidence that any of the proposed calculations can produce results over large areas of the Web that correlate with phenomena external to the Internet. This article attempts to provide such evidence through an evaluation of Ingwersen's (1998) proposed external Web Impact Factor (WIF) for the original use of the Web: the interlinking of academic research. In particular, it studies the case of the relationship between academic hyperlinks and research activity for universities in Britain, a country chosen for its variety of institutions and the existence of an official government rating exercise for research. After reviewing the numerous reasons why link counts may be unreliable, it demonstrates that four different WIFs do, in fact, correlate with the conventional academic research measures. The WIF delivering the greatest correlation with research rankings was the ratio of Web pages with links pointing at research-based pages to faculty numbers. The scarcity of links to electronic academic papers in the data set suggests that, in contrast to citation analysis, this WIF is measuring the reputations of universities and their scholars, rather than the quality of their publications
Vaughan, L.; Thelwall, M.: Scholarly use of the Web : what are the key inducers of links to journal Web sites? (2003) 0.04
```
0.036790654 = product of:
  0.091976635 = sum of:
    0.08158492 = weight(_text_:web in 1236) [ClassicSimilarity], result of:
      0.08158492 = score(doc=1236,freq=18.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.5408555 = fieldWeight in 1236, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1236)
    0.010391714 = product of:
      0.020783428 = sum of:
        0.020783428 = weight(_text_:research in 1236) [ClassicSimilarity], result of:
          0.020783428 = score(doc=1236,freq=2.0), product of:
            0.13186905 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046221454 = queryNorm
            0.15760657 = fieldWeight in 1236, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1236)
      0.5 = coord(1/2)
  0.4 = coord(2/5)
```
Abstract

Web links have been studied by information scientists for at least six years but it is only in the past two that clear evidence has emerged to show that counts of links to scholarly Web spaces (universities and departments) can correlate significantly with research measures, giving some credence to their use for the investigation of scholarly communication. This paper reports an a study to investigate the factors that influence the creation of links to journal Web sites. An empirical approach is used: collecting data and testing for significant patterns. The specific questions addressed are whether site age and site content are inducers of links to a journal's Web site as measured by the ratio of link counts to Journal Impact Factors, two variables previously discovered to be related. A new methodology for data collection is also introduced that uses the Internet Archive to obtain an earliest known creation date for Web sites. The results show that both site age and site content are significant factors for the disciplines studied: library and information science, and law. Comparisons between the two fields also show disciplinary differences in Web site characteristics. Scholars and publishers should be particularly aware that richer content an a journal's Web site tends to generate links and thus the traffic to the site.
Thelwall, M.; Vaughan, L.; Björneborn, L.: Webometrics (2004) 0.04
```
0.036790654 = product of:
  0.091976635 = sum of:
    0.08158492 = weight(_text_:web in 4279) [ClassicSimilarity], result of:
      0.08158492 = score(doc=4279,freq=18.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.5408555 = fieldWeight in 4279, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4279)
    0.010391714 = product of:
      0.020783428 = sum of:
        0.020783428 = weight(_text_:research in 4279) [ClassicSimilarity], result of:
          0.020783428 = score(doc=4279,freq=2.0), product of:
            0.13186905 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046221454 = queryNorm
            0.15760657 = fieldWeight in 4279, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4279)
      0.5 = coord(1/2)
  0.4 = coord(2/5)
```
Abstract

Webometrics, the quantitative study of Web-related phenomena, emerged from the realization that methods originally designed for bibliometric analysis of scientific journal article citation patterns could be applied to the Web, with commercial search engines providing the raw data. Almind and Ingwersen (1997) defined the field and gave it its name. Other pioneers included Rodriguez Gairin (1997) and Aguillo (1998). Larson (1996) undertook exploratory link structure analysis, as did Rousseau (1997). Webometrics encompasses research from fields beyond information science such as communication studies, statistical physics, and computer science. In this review we concentrate on link analysis, but also cover other aspects of webometrics, including Web log fle analysis. One theme that runs through this chapter is the messiness of Web data and the need for data cleansing heuristics. The uncontrolled Web creates numerous problems in the interpretation of results, for instance, from the automatic creation or replication of links. The loose connection between top-level domain specifications (e.g., com, edu, and org) and their actual content is also a frustrating problem. For example, many .com sites contain noncommercial content, although com is ostensibly the main commercial top-level domain. Indeed, a skeptical researcher could claim that obstacles of this kind are so great that all Web analyses lack value. As will be seen, one response to this view, a view shared by critics of evaluative bibliometrics, is to demonstrate that Web data correlate significantly with some non-Web data in order to prove that the Web data are not wholly random. A practical response has been to develop increasingly sophisticated data cleansing techniques and multiple data analysis methods.
Jepsen, E.T.; Seiden, P.; Ingwersen, P.; Björneborn, L.; Borlund, P.: Characteristics of scientific Web publications : preliminary data gathering and analysis (2004) 0.03
```
0.034924287 = product of:
  0.08731072 = sum of:
    0.076919004 = weight(_text_:web in 3091) [ClassicSimilarity], result of:
      0.076919004 = score(doc=3091,freq=16.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.5099235 = fieldWeight in 3091, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3091)
    0.010391714 = product of:
      0.020783428 = sum of:
        0.020783428 = weight(_text_:research in 3091) [ClassicSimilarity], result of:
          0.020783428 = score(doc=3091,freq=2.0), product of:
            0.13186905 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046221454 = queryNorm
            0.15760657 = fieldWeight in 3091, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3091)
      0.5 = coord(1/2)
  0.4 = coord(2/5)
```
Abstract

Because of the increasing presence of scientific publications an the Web, combined with the existing difficulties in easily verifying and retrieving these publications, research an techniques and methods for retrieval of scientific Web publications is called for. In this article, we report an the initial steps taken toward the construction of a test collection of scientific Web publications within the subject domain of plant biology. The steps reported are those of data gathering and data analysis aiming at identifying characteristics of scientific Web publications. The data used in this article were generated based an specifically selected domain topics that are searched for in three publicly accessible search engines (Google, AlITheWeb, and AItaVista). A sample of the retrieved hits was analyzed with regard to how various publication attributes correlated with the scientific quality of the content and whether this information could be employed to harvest, filter, and rank Web publications. The attributes analyzed were inlinks, outlinks, bibliographic references, file format, language, search engine overlap, structural position (according to site structure), and the occurrence of various types of metadata. As could be expected, the ranked output differs between the three search engines. Apparently, this is caused by differences in ranking algorithms rather than the databases themselves. In fact, because scientific Web content in this subject domain receives few inlinks, both AItaVista and AlITheWeb retrieved a higher degree of accessible scientific content than Google. Because of the search engine cutoffs of accessible URLs, the feasibility of using search engine output for Web content analysis is also discussed.

Thelwall, M.: ¬A comparison of sources of links for academic Web impact factor calculations (2002) 0.03

0.034746684 = product of:
  0.08686671 = sum of:
    0.065267935 = weight(_text_:web in 4474) [ClassicSimilarity], result of:
      0.065267935 = score(doc=4474,freq=8.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.43268442 = fieldWeight in 4474, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=4474)
    0.021598773 = product of:
      0.043197546 = sum of:
        0.043197546 = weight(_text_:research in 4474) [ClassicSimilarity], result of:
          0.043197546 = score(doc=4474,freq=6.0), product of:
            0.13186905 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046221454 = queryNorm
            0.3275791 = fieldWeight in 4474, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046875 = fieldNorm(doc=4474)
      0.5 = coord(1/2)
  0.4 = coord(2/5)

Abstract: There has been much recent interest in extracting information from collections of Web links. One tool that has been used is Ingwersen's Web impact factor. It has been demonstrated that several versions of this metric can produce results that correlate with research ratings of British universities showing that, despite being a measure of a purely Internet phenomenon, the results are susceptible to a wider interpretation. This paper addresses the question of which is the best possible domain to count backlinks from, if research is the focus of interest. WIFs for British universities calculated from several different source domains are compared, primarily the .edu, .ac.uk and .uk domains, and the entire Web. The results show that all four areas produce WIFs that correlate strongly with research ratings, but that none produce incontestably superior figures. It was also found that the WIF was less able to differentiate in more homogeneous subsets of universities, although positive results are still possible.

Thelwall, M.: Interpreting social science link analysis research : a theoretical framework (2006) 0.03

0.034746684 = product of:
  0.08686671 = sum of:
    0.065267935 = weight(_text_:web in 4908) [ClassicSimilarity], result of:
      0.065267935 = score(doc=4908,freq=8.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.43268442 = fieldWeight in 4908, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=4908)
    0.021598773 = product of:
      0.043197546 = sum of:
        0.043197546 = weight(_text_:research in 4908) [ClassicSimilarity], result of:
          0.043197546 = score(doc=4908,freq=6.0), product of:
            0.13186905 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046221454 = queryNorm
            0.3275791 = fieldWeight in 4908, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046875 = fieldNorm(doc=4908)
      0.5 = coord(1/2)
  0.4 = coord(2/5)

Abstract: Link analysis in various forms is now an established technique in many different subjects, reflecting the perceived importance of links and of the Web. A critical but very difficult issue is how to interpret the results of social science link analyses. lt is argued that the dynamic nature of the Web, its lack of quality control, and the online proliferation of copying and imitation mean that methodologies operating within a highly positivist, quantitative framework are ineffective. Conversely, the sheer variety of the Web makes application of qualitative methodologies and pure reason very problematic to large-scale studies. Methodology triangulation is consequently advocated, in combination with a warning that the Web is incapable of giving definitive answers to large-scale link analysis research questions concerning social factors underlying link creation. Finally, it is claimed that although theoretical frameworks are appropriate for guiding research, a Theory of Link Analysis is not possible.

Brody, T.; Harnad, S.; Carr, L.: Earlier Web usage statistics as predictors of later citation impact (2006) 0.03

0.03460754 = product of:
  0.086518854 = sum of:
    0.065944314 = weight(_text_:web in 165) [ClassicSimilarity], result of:
      0.065944314 = score(doc=165,freq=6.0), product of:
        0.1508442 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046221454 = queryNorm
        0.43716836 = fieldWeight in 165, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0546875 = fieldNorm(doc=165)
    0.020574544 = product of:
      0.041149087 = sum of:
        0.041149087 = weight(_text_:research in 165) [ClassicSimilarity], result of:
          0.041149087 = score(doc=165,freq=4.0), product of:
            0.13186905 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046221454 = queryNorm
            0.31204507 = fieldWeight in 165, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.0546875 = fieldNorm(doc=165)
      0.5 = coord(1/2)
  0.4 = coord(2/5)

Abstract: The use of citation counts to assess the impact of research articles is well established. However, the citation impact of an article can only be measured several years after it has been published. As research articles are increasingly accessed through the Web, the number of times an article is downloaded can be instantly recorded and counted. One would expect the number of times an article is read to be related both to the number of times it is cited and to how old the article is. The authors analyze how short-term Web usage impact predicts medium-term citation impact. The physics e-print archive-arXiv.org-is used to test this.

Search (43 results, page 1 of 3)

Authors

Languages

Types

Themes

Classifications