Search (11 results, page 1 of 1)

Zhang, Y.; Jansen, B.J.; Spink, A.: Identification of factors predicting clickthrough in Web searching using neural network analysis (2009) 0.03

0.03458574 = product of:
  0.05187861 = sum of:
    0.031063346 = weight(_text_:retrieval in 2742) [ClassicSimilarity], result of:
      0.031063346 = score(doc=2742,freq=2.0), product of:
        0.15490976 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.051211275 = queryNorm
        0.20052543 = fieldWeight in 2742, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.046875 = fieldNorm(doc=2742)
    0.020815263 = product of:
      0.041630525 = sum of:
        0.041630525 = weight(_text_:22 in 2742) [ClassicSimilarity], result of:
          0.041630525 = score(doc=2742,freq=2.0), product of:
            0.17933317 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.051211275 = queryNorm
            0.23214069 = fieldWeight in 2742, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=2742)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: In this research, we aim to identify factors that significantly affect the clickthrough of Web searchers. Our underlying goal is determine more efficient methods to optimize the clickthrough rate. We devise a clickthrough metric for measuring customer satisfaction of search engine results using the number of links visited, number of queries a user submits, and rank of clicked links. We use a neural network to detect the significant influence of searching characteristics on future user clickthrough. Our results show that high occurrences of query reformulation, lengthy searching duration, longer query length, and the higher ranking of prior clicked links correlate positively with future clickthrough. We provide recommendations for leveraging these findings for improving the performance of search engine retrieval and result ranking, along with implications for search engine marketing.
Date: 22. 3.2009 17:49:11

Thelwall, M.; Wilkinson, D.: Finding similar academic Web sites with links, bibliometric couplings and colinks (2004) 0.01
```
0.014643403 = product of:
  0.043930206 = sum of:
    0.043930206 = weight(_text_:retrieval in 2571) [ClassicSimilarity], result of:
      0.043930206 = score(doc=2571,freq=4.0), product of:
        0.15490976 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.051211275 = queryNorm
        0.2835858 = fieldWeight in 2571, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.046875 = fieldNorm(doc=2571)
  0.33333334 = coord(1/3)
```
Abstract

A common task in both Webmetrics and Web information retrieval is to identify a set of Web pages or sites that are similar in content. In this paper we assess the extent to which links, colinks and couplings can be used to identify similar Web sites. As an experiment, a random sample of 500 pairs of domains from the UK academic Web were taken and human assessments of site similarity, based upon content type, were compared against ratings for the three concepts. The results show that using a combination of all three gives the highest probability of identifying similar sites, but surprisingly this was only a marginal improvement over using links alone. Another unexpected result was that high values for either colink counts or couplings were associated with only a small increased likelihood of similarity. The principal advantage of using couplings and colinks was found to be greater coverage in terms of a much larger number of pairs of sites being connected by these measures, instead of increased probability of similarity. In information retrieval terminology, this is improved recall rather than improved precision.
Thelwall, M.; Sud, P.: ¬A comparison of methods for collecting web citation data for academic organizations (2011) 0.01
```
0.012202835 = product of:
  0.036608502 = sum of:
    0.036608502 = weight(_text_:retrieval in 4626) [ClassicSimilarity], result of:
      0.036608502 = score(doc=4626,freq=4.0), product of:
        0.15490976 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.051211275 = queryNorm
        0.23632148 = fieldWeight in 4626, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4626)
  0.33333334 = coord(1/3)
```
Abstract

The primary webometric method for estimating the online impact of an organization is to count links to its website. Link counts have been available from commercial search engines for over a decade but this was set to end by early 2012 and so a replacement is needed. This article compares link counts to two alternative methods: URL citations and organization title mentions. New variations of these methods are also introduced. The three methods are compared against each other using Yahoo!. Two of the three methods (URL citations and organization title mentions) are also compared against each other using Bing. Evidence from a case study of 131 UK universities and 49 US Library and Information Science (LIS) departments suggests that Bing's Hit Count Estimates (HCEs) for popular title searches are not useful for webometric research but that Yahoo!'s HCEs for all three types of search and Bing's URL citation HCEs seem to be consistent. For exact URL counts the results of all three methods in Yahoo! and both methods in Bing are also consistent. Four types of accuracy factors are also introduced and defined: search engine coverage, search engine retrieval variation, search engine retrieval anomalies, and query polysemy.

Menczer, F.: Lexical and semantic clustering by Web links (2004) 0.01

0.010354449 = product of:
  0.031063346 = sum of:
    0.031063346 = weight(_text_:retrieval in 3090) [ClassicSimilarity], result of:
      0.031063346 = score(doc=3090,freq=2.0), product of:
        0.15490976 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.051211275 = queryNorm
        0.20052543 = fieldWeight in 3090, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.046875 = fieldNorm(doc=3090)
  0.33333334 = coord(1/3)

Theme: Semantisches Umfeld in Indexierung u. Retrieval

Jepsen, E.T.; Seiden, P.; Ingwersen, P.; Björneborn, L.; Borlund, P.: Characteristics of scientific Web publications : preliminary data gathering and analysis (2004) 0.01
```
0.008628707 = product of:
  0.025886122 = sum of:
    0.025886122 = weight(_text_:retrieval in 3091) [ClassicSimilarity], result of:
      0.025886122 = score(doc=3091,freq=2.0), product of:
        0.15490976 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.051211275 = queryNorm
        0.16710453 = fieldWeight in 3091, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3091)
  0.33333334 = coord(1/3)
```
Abstract

Because of the increasing presence of scientific publications an the Web, combined with the existing difficulties in easily verifying and retrieving these publications, research an techniques and methods for retrieval of scientific Web publications is called for. In this article, we report an the initial steps taken toward the construction of a test collection of scientific Web publications within the subject domain of plant biology. The steps reported are those of data gathering and data analysis aiming at identifying characteristics of scientific Web publications. The data used in this article were generated based an specifically selected domain topics that are searched for in three publicly accessible search engines (Google, AlITheWeb, and AItaVista). A sample of the retrieved hits was analyzed with regard to how various publication attributes correlated with the scientific quality of the content and whether this information could be employed to harvest, filter, and rank Web publications. The attributes analyzed were inlinks, outlinks, bibliographic references, file format, language, search engine overlap, structural position (according to site structure), and the occurrence of various types of metadata. As could be expected, the ranked output differs between the three search engines. Apparently, this is caused by differences in ranking algorithms rather than the databases themselves. In fact, because scientific Web content in this subject domain receives few inlinks, both AItaVista and AlITheWeb retrieved a higher degree of accessible scientific content than Google. Because of the search engine cutoffs of accessible URLs, the feasibility of using search engine output for Web content analysis is also discussed.
Amitay, E.; Carmel, D.; Herscovici, M.; Lempel, R.; Soffer, A.: Trend detection through temporal link analysis (2004) 0.01
```
0.008628707 = product of:
  0.025886122 = sum of:
    0.025886122 = weight(_text_:retrieval in 3092) [ClassicSimilarity], result of:
      0.025886122 = score(doc=3092,freq=2.0), product of:
        0.15490976 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.051211275 = queryNorm
        0.16710453 = fieldWeight in 3092, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3092)
  0.33333334 = coord(1/3)
```
Abstract

Although time has been recognized as an important dimension in the co-citation literature, to date it has not been incorporated into the analogous process of link analysis an the Web. In this paper, we discuss several aspects and uses of the time dimension in the context of Web information retrieval. We describe the ideal casewhere search engines track and store temporal data for each of the pages in their repository, assigning timestamps to the hyperlinks embedded within the pages. We introduce several applications which benefit from the availability of such timestamps. To demonstrate our claims, we use a somewhat simplistic approach, which dates links by approximating the age of the page's content. We show that by using this crude measure alone it is possible to detect and expose significant events and trends. We predict that by using more robust methods for tracking modifications in the content of pages, search engines will be able to provide results that are more timely and better reflect current real-life trends than those they provide today.

Zhang, Y.: ¬The impact of Internet-based electronic resources on formal scholarly communication in the area of library and information science : a citation analysis (1998) 0.01

0.008177008 = product of:
  0.024531022 = sum of:
    0.024531022 = product of:
      0.049062043 = sum of:
        0.049062043 = weight(_text_:22 in 2808) [ClassicSimilarity], result of:
          0.049062043 = score(doc=2808,freq=4.0), product of:
            0.17933317 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.051211275 = queryNorm
            0.27358043 = fieldWeight in 2808, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2808)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Date: 30. 1.1999 17:22:22

Neth, M.: Citation analysis and the Web (1998) 0.01

0.008094825 = product of:
  0.024284473 = sum of:
    0.024284473 = product of:
      0.048568945 = sum of:
        0.048568945 = weight(_text_:22 in 108) [ClassicSimilarity], result of:
          0.048568945 = score(doc=108,freq=2.0), product of:
            0.17933317 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.051211275 = queryNorm
            0.2708308 = fieldWeight in 108, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=108)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Date: 10. 1.1999 16:22:37

Tonta, Y.: Scholarly communication and the use of networked information sources (1996) 0.01

0.006938421 = product of:
  0.020815263 = sum of:
    0.020815263 = product of:
      0.041630525 = sum of:
        0.041630525 = weight(_text_:22 in 6389) [ClassicSimilarity], result of:
          0.041630525 = score(doc=6389,freq=2.0), product of:
            0.17933317 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.051211275 = queryNorm
            0.23214069 = fieldWeight in 6389, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=6389)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Source: IFLA journal. 22(1996) no.3, S.240-245

Raan, A.F.J. van; Noyons, E.C.M.: Discovery of patterns of scientific and technological development and knowledge transfer (2002) 0.01

0.006779279 = product of:
  0.020337837 = sum of:
    0.020337837 = product of:
      0.040675674 = sum of:
        0.040675674 = weight(_text_:conference in 3603) [ClassicSimilarity], result of:
          0.040675674 = score(doc=3603,freq=2.0), product of:
            0.19418365 = queryWeight, product of:
              3.7918143 = idf(docFreq=2710, maxDocs=44218)
              0.051211275 = queryNorm
            0.20947012 = fieldWeight in 3603, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7918143 = idf(docFreq=2710, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3603)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Source: Gaining insight from research information (CRIS2002): Proceedings of the 6th International Conference an Current Research Information Systems, University of Kassel, August 29 - 31, 2002. Eds: W. Adamczak u. A. Nase

Maharana, B.; Nayak, K.; Sahu, N.K.: Scholarly use of web resources in LIS research : a citation analysis (2006) 0.01
```
0.006779279 = product of:
  0.020337837 = sum of:
    0.020337837 = product of:
      0.040675674 = sum of:
        0.040675674 = weight(_text_:conference in 53) [ClassicSimilarity], result of:
          0.040675674 = score(doc=53,freq=2.0), product of:
            0.19418365 = queryWeight, product of:
              3.7918143 = idf(docFreq=2710, maxDocs=44218)
              0.051211275 = queryNorm
            0.20947012 = fieldWeight in 53, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7918143 = idf(docFreq=2710, maxDocs=44218)
              0.0390625 = fieldNorm(doc=53)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)
```
Abstract

Purpose - The essential purpose of this paper is to measure the amount of web resources used for scholarly contributions in the area of library and information science (LIS) in India. It further aims to make an analysis of the nature and type of web resources and studies the various standards for web citations. Design/methodology/approach - In this study, the result of analysis of 292 web citations spread over 95 scholarly papers published in the proceedings of the National Conference of the Society for Information Science, India (SIS-2005) has been reported. All the 292 web citations were scanned and data relating to types of web domains, file formats, styles of citations, etc., were collected through a structured check list. The data thus obtained were systematically analyzed, figurative representations were made and appropriate interpretations were drawn. Findings - The study revealed that 292 (34.88 per cent) out of 837 were web citations, proving a significant correlation between the use of Internet resources and research productivity of LIS professionals in India. The highest number of web citations (35.6 per cent) was from .edu/.ac type domains. Most of the web resources (46.9 per cent) cited in the study were hypertext markup language (HTML) files. Originality/value - The paper is the result of an original analysis of web citations undertaken in order to study the dependence of LIS professionals in India on web sources for their scholarly contributions. This carries research value for web content providers, authors and researchers in LIS.

Search (11 results, page 1 of 1)

Authors

Years

Themes