Search (2 results, page 1 of 1)

Nejdl, W.; Risse, T.: Herausforderungen für die nationale, regionale und thematische Webarchivierung und deren Nutzung (2015) 0.06

0.060637787 = product of:
  0.18191336 = sum of:
    0.050318997 = weight(_text_:web in 2531) [ClassicSimilarity], result of:
      0.050318997 = score(doc=2531,freq=8.0), product of:
        0.11629491 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.035634913 = queryNorm
        0.43268442 = fieldWeight in 2531, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=2531)
    0.034899916 = weight(_text_:world in 2531) [ClassicSimilarity], result of:
      0.034899916 = score(doc=2531,freq=2.0), product of:
        0.13696888 = queryWeight, product of:
          3.8436708 = idf(docFreq=2573, maxDocs=44218)
          0.035634913 = queryNorm
        0.25480178 = fieldWeight in 2531, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.8436708 = idf(docFreq=2573, maxDocs=44218)
          0.046875 = fieldNorm(doc=2531)
    0.046375446 = weight(_text_:wide in 2531) [ClassicSimilarity], result of:
      0.046375446 = score(doc=2531,freq=2.0), product of:
        0.1578897 = queryWeight, product of:
          4.4307585 = idf(docFreq=1430, maxDocs=44218)
          0.035634913 = queryNorm
        0.29372054 = fieldWeight in 2531, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.4307585 = idf(docFreq=1430, maxDocs=44218)
          0.046875 = fieldNorm(doc=2531)
    0.050318997 = weight(_text_:web in 2531) [ClassicSimilarity], result of:
      0.050318997 = score(doc=2531,freq=8.0), product of:
        0.11629491 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.035634913 = queryNorm
        0.43268442 = fieldWeight in 2531, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=2531)
  0.33333334 = coord(4/12)

Abstract: Das World Wide Web ist als weltweites Informations- und Kommunikationsmedium etabliert. Neue Technologien erweitern regelmäßig die Nutzungsformen und erlauben es auch unerfahrenen Nutzern, Inhalte zu publizieren oder an Diskussionen teilzunehmen. Daher wird das Web auch als eine gute Dokumentation der heutigen Gesellschaft angesehen. Aufgrund seiner Dynamik sind die Inhalte des Web vergänglich und neue Technologien und Nutzungsformen stellen regelmäßig neue Herausforderungen an die Sammlung von Webinhalten für die Webarchivierung. Dominierten in den Anfangstagen der Webarchivierung noch statische Seiten, so hat man es heute häufig mit dynamisch generierten Inhalten zu tun, die Informationen aus verschiedenen Quellen integrieren. Neben dem klassischen domainorientieren Webharvesting kann auch ein steigendes Interesse aus verschiedenen Forschungsdisziplinen an thematischen Webkollektionen und deren Nutzung und Exploration beobachtet werden. In diesem Artikel werden einige Herausforderungen und Lösungsansätze für die Sammlung von thematischen und dynamischen Inhalten aus dem Web und den sozialen Medien vorgestellt. Des Weiteren werden aktuelle Probleme der wissenschaftlichen Nutzung diskutiert und gezeigt, wie Webarchive und andere temporale Kollektionen besser durchsucht werden können.

Souza, J.; Carvalho, A.; Cristo, M.; Moura, E.; Calado, P.; Chirita, P.-A.; Nejdl, W.: Using site-level connections to estimate link confidence (2012) 0.01
```
0.012104871 = product of:
  0.07262922 = sum of:
    0.03631461 = weight(_text_:web in 498) [ClassicSimilarity], result of:
      0.03631461 = score(doc=498,freq=6.0), product of:
        0.11629491 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.035634913 = queryNorm
        0.3122631 = fieldWeight in 498, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=498)
    0.03631461 = weight(_text_:web in 498) [ClassicSimilarity], result of:
      0.03631461 = score(doc=498,freq=6.0), product of:
        0.11629491 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.035634913 = queryNorm
        0.3122631 = fieldWeight in 498, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=498)
  0.16666667 = coord(2/12)
```
Abstract

Search engines are essential tools for web users today. They rely on a large number of features to compute the rank of search results for each given query. The estimated reputation of pages is among the effective features available for search engine designers, probably being adopted by most current commercial search engines. Page reputation is estimated by analyzing the linkage relationships between pages. This information is used by link analysis algorithms as a query-independent feature, to be taken into account when computing the rank of the results. Unfortunately, several types of links found on the web may damage the estimated page reputation and thus cause a negative effect on the quality of search results. This work studies alternatives to reduce the negative impact of such noisy links. More specifically, the authors propose and evaluate new methods that deal with noisy links, considering scenarios where the reputation of pages is computed using the PageRank algorithm. They show, through experiments with real web content, that their methods achieve significant improvements when compared to previous solutions proposed in the literature.

Search (2 results, page 1 of 1)

Authors

Languages

Themes