Search (36 results, page 2 of 2)

  • × theme_ss:"Internet"
  • × year_i:[2010 TO 2020}
  1. Maemura, E.; Worby, N.; Milligan, I.; Becker, C.: If these crawls could talk : studying and documenting web archives provenance (2018) 0.01
    0.00804336 = product of:
      0.01608672 = sum of:
        0.01608672 = product of:
          0.03217344 = sum of:
            0.03217344 = weight(_text_:i in 4465) [ClassicSimilarity], result of:
              0.03217344 = score(doc=4465,freq=2.0), product of:
                0.15441231 = queryWeight, product of:
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.04093939 = queryNorm
                0.20836058 = fieldWeight in 4465, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4465)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  2. Nikolov, D.; Lalmas, M.; Flammini, A.; Menczer, F.: Quantifying biases in online information exposure (2019) 0.01
    0.00804336 = product of:
      0.01608672 = sum of:
        0.01608672 = product of:
          0.03217344 = sum of:
            0.03217344 = weight(_text_:i in 4986) [ClassicSimilarity], result of:
              0.03217344 = score(doc=4986,freq=2.0), product of:
                0.15441231 = queryWeight, product of:
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.04093939 = queryNorm
                0.20836058 = fieldWeight in 4986, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4986)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Our consumption of online information is mediated by filtering, ranking, and recommendation algorithms that introduce unintentional biases as they attempt to deliver relevant and engaging content. It has been suggested that our reliance on online technologies such as search engines and social media may limit exposure to diverse points of view and make us vulnerable to manipulation by disinformation. In this article, we mine a massive data set of web traffic to quantify two kinds of bias: (i) homogeneity bias, which is the tendency to consume content from a narrow set of information sources, and (ii) popularity bias, which is the selective exposure to content from top sites. Our analysis reveals different bias levels across several widely used web platforms. Search exposes users to a diverse set of sources, while social media traffic tends to exhibit high popularity and homogeneity bias. When we focus our analysis on traffic to news sites, we find higher levels of popularity bias, with smaller differences across applications. Overall, our results quantify the extent to which our choices of online systems confine us inside "social bubbles."
  3. Pluye, P.; El Sherif, R.; Granikov, V.; Hong, Q.N.; Vedel, I.; Barbosa Galvao, M.C.; Frati, F.E.Y.; Desroches, S.; Repchinsky, C.; Rihoux, B.; Légaré, F.; Burnand, B.; Bujold, M.; Grad, R.: Health outcomes of online consumer health information : a systematic mixed studies review with framework synthesis (2019) 0.01
    0.00804336 = product of:
      0.01608672 = sum of:
        0.01608672 = product of:
          0.03217344 = sum of:
            0.03217344 = weight(_text_:i in 5292) [ClassicSimilarity], result of:
              0.03217344 = score(doc=5292,freq=2.0), product of:
                0.15441231 = queryWeight, product of:
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.04093939 = queryNorm
                0.20836058 = fieldWeight in 5292, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5292)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  4. Dufour, C.; Bartlett, J.C.; Toms, E.G.: Understanding how webcasts are used as sources of information (2011) 0.01
    0.006933403 = product of:
      0.013866806 = sum of:
        0.013866806 = product of:
          0.027733613 = sum of:
            0.027733613 = weight(_text_:22 in 4195) [ClassicSimilarity], result of:
              0.027733613 = score(doc=4195,freq=2.0), product of:
                0.14336278 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04093939 = queryNorm
                0.19345059 = fieldWeight in 4195, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4195)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 1.2011 14:16:14
  5. Zimmer, M.; Proferes, N.J.: ¬A topology of Twitter research : disciplines, methods, and ethics (2014) 0.01
    0.006933403 = product of:
      0.013866806 = sum of:
        0.013866806 = product of:
          0.027733613 = sum of:
            0.027733613 = weight(_text_:22 in 1622) [ClassicSimilarity], result of:
              0.027733613 = score(doc=1622,freq=2.0), product of:
                0.14336278 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04093939 = queryNorm
                0.19345059 = fieldWeight in 1622, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1622)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    20. 1.2015 18:30:22
  6. Bhattacharya, S.; Yang, C.; Srinivasan, P.; Boynton, B.: Perceptions of presidential candidates' personalities in twitter (2016) 0.01
    0.006933403 = product of:
      0.013866806 = sum of:
        0.013866806 = product of:
          0.027733613 = sum of:
            0.027733613 = weight(_text_:22 in 2635) [ClassicSimilarity], result of:
              0.027733613 = score(doc=2635,freq=2.0), product of:
                0.14336278 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04093939 = queryNorm
                0.19345059 = fieldWeight in 2635, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2635)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 1.2016 11:25:47
  7. Bhatia, S.; Biyani, P.; Mitra, P.: Identifying the role of individual user messages in an online discussion and its use in thread retrieval (2016) 0.01
    0.006933403 = product of:
      0.013866806 = sum of:
        0.013866806 = product of:
          0.027733613 = sum of:
            0.027733613 = weight(_text_:22 in 2650) [ClassicSimilarity], result of:
              0.027733613 = score(doc=2650,freq=2.0), product of:
                0.14336278 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04093939 = queryNorm
                0.19345059 = fieldWeight in 2650, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2650)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 1.2016 11:50:46
  8. Dalip, D.H.; Gonçalves, M.A.; Cristo, M.; Calado, P.: ¬A general multiview framework for assessing the quality of collaboratively created content on web 2.0 (2017) 0.01
    0.006933403 = product of:
      0.013866806 = sum of:
        0.013866806 = product of:
          0.027733613 = sum of:
            0.027733613 = weight(_text_:22 in 3343) [ClassicSimilarity], result of:
              0.027733613 = score(doc=3343,freq=2.0), product of:
                0.14336278 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04093939 = queryNorm
                0.19345059 = fieldWeight in 3343, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3343)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    16.11.2017 13:04:22
  9. Bünte, O.: Bundesdatenschutzbeauftragte bezweifelt Facebooks Datenschutzversprechen (2018) 0.01
    0.006933403 = product of:
      0.013866806 = sum of:
        0.013866806 = product of:
          0.027733613 = sum of:
            0.027733613 = weight(_text_:22 in 4180) [ClassicSimilarity], result of:
              0.027733613 = score(doc=4180,freq=2.0), product of:
                0.14336278 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04093939 = queryNorm
                0.19345059 = fieldWeight in 4180, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4180)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    23. 3.2018 13:41:22
  10. Barrio, P.; Gravano, L.: Sampling strategies for information extraction over the deep web (2017) 0.01
    0.0064346883 = product of:
      0.012869377 = sum of:
        0.012869377 = product of:
          0.025738753 = sum of:
            0.025738753 = weight(_text_:i in 3412) [ClassicSimilarity], result of:
              0.025738753 = score(doc=3412,freq=2.0), product of:
                0.15441231 = queryWeight, product of:
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.04093939 = queryNorm
                0.16668847 = fieldWeight in 3412, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.03125 = fieldNorm(doc=3412)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Information extraction systems discover structured information in natural language text. Having information in structured form enables much richer querying and data mining than possible over the natural language text. However, information extraction is a computationally expensive task, and hence improving the efficiency of the extraction process over large text collections is of critical interest. In this paper, we focus on an especially valuable family of text collections, namely, the so-called deep-web text collections, whose contents are not crawlable and are only available via querying. Important steps for efficient information extraction over deep-web text collections (e.g., selecting the collections on which to focus the extraction effort, based on their contents; or learning which documents within these collections-and in which order-to process, based on their words and phrases) require having a representative document sample from each collection. These document samples have to be collected by querying the deep-web text collections, an expensive process that renders impractical the existing sampling approaches developed for other data scenarios. In this paper, we systematically study the space of query-based document sampling techniques for information extraction over the deep web. Specifically, we consider (i) alternative query execution schedules, which vary on how they account for the query effectiveness, and (ii) alternative document retrieval and processing schedules, which vary on how they distribute the extraction effort over documents. We report the results of the first large-scale experimental evaluation of sampling techniques for information extraction over the deep web. Our results show the merits and limitations of the alternative query execution and document retrieval and processing strategies, and provide a roadmap for addressing this critically important building block for efficient, scalable information extraction.
  11. Willkommen in der Datenwelt! : Haben wir unser digitales Leben noch im Griff? (2017) 0.01
    0.0064346883 = product of:
      0.012869377 = sum of:
        0.012869377 = product of:
          0.025738753 = sum of:
            0.025738753 = weight(_text_:i in 3420) [ClassicSimilarity], result of:
              0.025738753 = score(doc=3420,freq=2.0), product of:
                0.15441231 = queryWeight, product of:
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.04093939 = queryNorm
                0.16668847 = fieldWeight in 3420, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.03125 = fieldNorm(doc=3420)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Content
    Digital-Manifest (I): Digitale Demokratie statt Datendiktatur: Es droht die Automatisierung der Gesellschaft. (S.6) - Digital-Manifest (II): Eine Strategie für das digitale Zeitalter: Wie können wir unsere Demokratie schützen? Digital-Manifest: Expertenkommentare(S.15) - Die Debatte über mögliche Gefahren geht weiter. Essay: Was ist uns die Privatsphäre wert? (S.18) - Wer uneingeschränkt persönliche Informationen preisgibt, liefert sich aus. Datenschutz: Die Folgen der digitalen Transparenz (S.20) - Neuen Organisationsformen gehört die Zukunft. Epidemiologie: Eine Diagnose der Menschheit (S.28) - Mit einem Supercomputer stellen Forscher den globalen Gesundheitszustand fest. Maschinelles Lernen: Intelligenz bei Mensch und Computer (S.32) - Im Gegensatz zu uns lösen Algorithmen meist nur sehr spezielle Probleme - zumindest bislang. Ethik: Wir müssen autonome Killerroboter verbieten (S.40) - Maschinen überblicken nicht die Folgen ihres Tuns. Big und Smart Data: Zweckbindung zwecklos? (S.48) - Die Weiterverwendbarkeit von Daten stößt an Grenzen. Datensparsamkeit: Zukunftsfähig statt überholt (S.56) - Die Weiterverwendbarkeit von Daten stößt an Grenzen. Informationelle Selbstbestimmung: Ein erstrebenswertes Ziel (S.62) - Persönliche Entfaltung muss möglich bleiben. Digital Dienste: Zwischen Innovation und Sicherheit (S.68) - Schutz- und Kontrollbedarf fördert das Nutzervertrauen. Privatsphäre: Interview "Jede Äußerung wird konserviert" (S.74) - Sammelwut bedroht uns alle, meint Markus Beckedahl. (S.78)
  12. Kaeser, E.: ¬Das postfaktische Zeitalter (2016) 0.01
    0.005883188 = product of:
      0.011766376 = sum of:
        0.011766376 = product of:
          0.023532752 = sum of:
            0.023532752 = weight(_text_:22 in 3080) [ClassicSimilarity], result of:
              0.023532752 = score(doc=3080,freq=4.0), product of:
                0.14336278 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04093939 = queryNorm
                0.16414827 = fieldWeight in 3080, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0234375 = fieldNorm(doc=3080)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Content
    "Es gibt Daten, Informationen und Fakten. Wenn man mir eine Zahlenreihe vorsetzt, dann handelt es sich um Daten: unterscheidbare Einheiten, im Fachjargon: Items. Wenn man mir sagt, dass diese Items stündliche Temperaturangaben der Aare im Berner Marzilibad bedeuten, dann verfüge ich über Information - über interpretierte Daten. Wenn man mir sagt, dies seien die gemessenen Aaretemperaturen am 22. August 2016 im Marzili, dann ist das ein Faktum: empirisch geprüfte interpretierte Daten. Dieser Dreischritt - Unterscheiden, Interpretieren, Prüfen - bildet quasi das Bindemittel des Faktischen, «the matter of fact». Wir alle führen den Dreischritt ständig aus und gelangen so zu einem relativ verlässlichen Wissen und Urteilsvermögen betreffend die Dinge des Alltags. Aber wie schon die Kurzcharakterisierung durchblicken lässt, bilden Fakten nicht den Felsengrund der Realität. Sie sind kritikanfällig, sowohl von der Interpretation wie auch von der Prüfung her gesehen. Um bei unserem Beispiel zu bleiben: Es kann durchaus sein, dass man uns zwei unterschiedliche «faktische» Temperaturverläufe der Aare am 22. August 2016 vorsetzt.
  13. Joint, N.: Web 2.0 and the library : a transformational technology? (2010) 0.01
    0.0055467226 = product of:
      0.011093445 = sum of:
        0.011093445 = product of:
          0.02218689 = sum of:
            0.02218689 = weight(_text_:22 in 4202) [ClassicSimilarity], result of:
              0.02218689 = score(doc=4202,freq=2.0), product of:
                0.14336278 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04093939 = queryNorm
                0.15476047 = fieldWeight in 4202, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=4202)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 1.2011 17:54:04
  14. Jamali, H.R.; Shahbaztabar, P.: ¬The effects of internet filtering on users' information-seeking behaviour and emotions (2017) 0.01
    0.0055467226 = product of:
      0.011093445 = sum of:
        0.011093445 = product of:
          0.02218689 = sum of:
            0.02218689 = weight(_text_:22 in 4052) [ClassicSimilarity], result of:
              0.02218689 = score(doc=4052,freq=2.0), product of:
                0.14336278 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04093939 = queryNorm
                0.15476047 = fieldWeight in 4052, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=4052)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    20. 1.2015 18:30:22
  15. Griesbaum, J.; Mahrholz, N.; Kiedrowski, K. von Löwe; Rittberger, M.: Knowledge generation in online forums : a case study in the German educational domain (2015) 0.01
    0.0055467226 = product of:
      0.011093445 = sum of:
        0.011093445 = product of:
          0.02218689 = sum of:
            0.02218689 = weight(_text_:22 in 4440) [ClassicSimilarity], result of:
              0.02218689 = score(doc=4440,freq=2.0), product of:
                0.14336278 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04093939 = queryNorm
                0.15476047 = fieldWeight in 4440, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=4440)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    20. 1.2015 18:30:22
  16. Höhn, S.: Stalins Badezimmer in Wikipedia : Die Macher der Internet-Enzyklopädie diskutieren über Verantwortung und Transparenz. Der Brockhaus kehrt dagegen zur gedruckten Ausgabe zurück. (2012) 0.00
    0.004902656 = product of:
      0.009805312 = sum of:
        0.009805312 = product of:
          0.019610625 = sum of:
            0.019610625 = weight(_text_:22 in 2171) [ClassicSimilarity], result of:
              0.019610625 = score(doc=2171,freq=4.0), product of:
                0.14336278 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04093939 = queryNorm
                0.13679022 = fieldWeight in 2171, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.01953125 = fieldNorm(doc=2171)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Content
    Der neue Herausgeber des Brockhaus, ein Tochterverlag von Bertelsmann, hat unterdessen angekündigt, zum gedruckten Lexikon zurückzukehren. Etwa Anfang 2015 soll die 22. Auflage erscheinen. In Zeiten des virtuellen Informationsoverkills gebe es einen Bedarf an Orientierung, an Relevanzvorgaben, sagt Geschäftsführer Christoph Hünermann. Ausgerechnet Bertelsmann druckte 2008 ein knapp 1 000 Seiten langes Wikipedia-Lexikon mit den 50 000 meist gesuchten Begriffen. Eine Experten-Redaktion überprüfte die Einträge sicherheitshalber zuvor - soll allerdings kaum Fehler gefunden haben."
    Source
    Frankfurter Rundschau. Nr.76 vom 29.3.2012, S.22-23

Languages

  • e 26
  • d 9

Types

  • a 32
  • el 5
  • s 2
  • m 1
  • More… Less…