Search (144 results, page 1 of 8)

  • × theme_ss:"Data Mining"
  1. Bella, A. La; Fronzetti Colladon, A.; Battistoni, E.; Castellan, S.; Francucci, M.: Assessing perceived organizational leadership styles through twitter text mining (2018) 0.03
    0.033417743 = product of:
      0.08354436 = sum of:
        0.0074119437 = weight(_text_:e in 2400) [ClassicSimilarity], result of:
          0.0074119437 = score(doc=2400,freq=4.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.13475344 = fieldWeight in 2400, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.046875 = fieldNorm(doc=2400)
        0.07613242 = weight(_text_:69 in 2400) [ClassicSimilarity], result of:
          0.07613242 = score(doc=2400,freq=2.0), product of:
            0.20963728 = queryWeight, product of:
              5.478287 = idf(docFreq=501, maxDocs=44218)
              0.03826694 = queryNorm
            0.36316258 = fieldWeight in 2400, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.478287 = idf(docFreq=501, maxDocs=44218)
              0.046875 = fieldNorm(doc=2400)
      0.4 = coord(2/5)
    
    Language
    e
    Source
    Journal of the Association for Information Science and Technology. 69(2018) no.1, S.21-31
  2. Ebrahimi, M.; ShafieiBavani, E.; Wong, R.; Chen, F.: Twitter user geolocation by filtering of highly mentioned users (2018) 0.03
    0.033417743 = product of:
      0.08354436 = sum of:
        0.0074119437 = weight(_text_:e in 4286) [ClassicSimilarity], result of:
          0.0074119437 = score(doc=4286,freq=4.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.13475344 = fieldWeight in 4286, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.046875 = fieldNorm(doc=4286)
        0.07613242 = weight(_text_:69 in 4286) [ClassicSimilarity], result of:
          0.07613242 = score(doc=4286,freq=2.0), product of:
            0.20963728 = queryWeight, product of:
              5.478287 = idf(docFreq=501, maxDocs=44218)
              0.03826694 = queryNorm
            0.36316258 = fieldWeight in 4286, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.478287 = idf(docFreq=501, maxDocs=44218)
              0.046875 = fieldNorm(doc=4286)
      0.4 = coord(2/5)
    
    Language
    e
    Source
    Journal of the Association for Information Science and Technology. 69(2018) no.7, S.879-889
  3. Tonkin, E.L.; Tourte, G.J.L.: Working with text. tools, techniques and approaches for text mining (2016) 0.03
    0.027124483 = product of:
      0.067811206 = sum of:
        0.00436753 = weight(_text_:e in 4019) [ClassicSimilarity], result of:
          0.00436753 = score(doc=4019,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.07940422 = fieldWeight in 4019, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4019)
        0.063443676 = weight(_text_:69 in 4019) [ClassicSimilarity], result of:
          0.063443676 = score(doc=4019,freq=2.0), product of:
            0.20963728 = queryWeight, product of:
              5.478287 = idf(docFreq=501, maxDocs=44218)
              0.03826694 = queryNorm
            0.30263546 = fieldWeight in 4019, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.478287 = idf(docFreq=501, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4019)
      0.4 = coord(2/5)
    
    Footnote
    Rez. in: JASIST 69(2018) no.1, S.181-184 (Jacques Savoy).
    Language
    e
  4. Chowdhury, G.G.: Template mining for information extraction from digital documents (1999) 0.02
    0.01940863 = product of:
      0.048521575 = sum of:
        0.012229082 = weight(_text_:e in 4577) [ClassicSimilarity], result of:
          0.012229082 = score(doc=4577,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.2223318 = fieldWeight in 4577, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.109375 = fieldNorm(doc=4577)
        0.036292493 = product of:
          0.07258499 = sum of:
            0.07258499 = weight(_text_:22 in 4577) [ClassicSimilarity], result of:
              0.07258499 = score(doc=4577,freq=2.0), product of:
                0.1340043 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03826694 = queryNorm
                0.5416616 = fieldWeight in 4577, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=4577)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Date
    2. 4.2000 18:01:22
    Language
    e
  5. Zhang, Z.; Li, Q.; Zeng, D.; Ga, H.: Extracting evolutionary communities in community question answering (2014) 0.02
    0.017561922 = product of:
      0.043904804 = sum of:
        0.00436753 = weight(_text_:e in 1286) [ClassicSimilarity], result of:
          0.00436753 = score(doc=1286,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.07940422 = fieldWeight in 1286, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1286)
        0.039537273 = product of:
          0.11861182 = sum of:
            0.11861182 = weight(_text_:evolution in 1286) [ClassicSimilarity], result of:
              0.11861182 = score(doc=1286,freq=8.0), product of:
                0.2026858 = queryWeight, product of:
                  5.29663 = idf(docFreq=601, maxDocs=44218)
                  0.03826694 = queryNorm
                0.5852004 = fieldWeight in 1286, product of:
                  2.828427 = tf(freq=8.0), with freq of:
                    8.0 = termFreq=8.0
                  5.29663 = idf(docFreq=601, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1286)
          0.33333334 = coord(1/3)
      0.4 = coord(2/5)
    
    Abstract
    With the rapid growth of Web 2.0, community question answering (CQA) has become a prevalent information seeking channel, in which users form interactive communities by posting questions and providing answers. Communities may evolve over time, because of changes in users' interests, activities, and new users joining the network. To better understand user interactions in CQA communities, it is necessary to analyze the community structures and track community evolution over time. Existing work in CQA focuses on question searching or content quality detection, and the important problems of community extraction and evolutionary pattern detection have not been studied. In this article, we propose a probabilistic community model (PCM) to extract overlapping community structures and capture their evolution patterns in CQA. The empirical results show that our algorithm appears to improve the community extraction quality. We show empirically, using the iPhone data set, that interesting community evolution patterns can be discovered, with each evolution pattern reflecting the variation of users' interests over time. Our analysis suggests that individual users could benefit to gain comprehensive information from tracking the transition of products. We also show that the communities provide a decision-making basis for business.
    Language
    e
  6. KDD : techniques and applications (1998) 0.02
    0.01663597 = product of:
      0.041589923 = sum of:
        0.010482071 = weight(_text_:e in 6783) [ClassicSimilarity], result of:
          0.010482071 = score(doc=6783,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.19057012 = fieldWeight in 6783, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.09375 = fieldNorm(doc=6783)
        0.03110785 = product of:
          0.0622157 = sum of:
            0.0622157 = weight(_text_:22 in 6783) [ClassicSimilarity], result of:
              0.0622157 = score(doc=6783,freq=2.0), product of:
                0.1340043 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03826694 = queryNorm
                0.46428138 = fieldWeight in 6783, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=6783)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Footnote
    A special issue of selected papers from the Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD'97), held Singapore, 22-23 Feb 1997
    Language
    e
  7. Song, J.; Huang, Y.; Qi, X.; Li, Y.; Li, F.; Fu, K.; Huang, T.: Discovering hierarchical topic evolution in time-stamped documents (2016) 0.02
    0.015515811 = product of:
      0.038789526 = sum of:
        0.0052410355 = weight(_text_:e in 2853) [ClassicSimilarity], result of:
          0.0052410355 = score(doc=2853,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.09528506 = fieldWeight in 2853, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.046875 = fieldNorm(doc=2853)
        0.03354849 = product of:
          0.10064547 = sum of:
            0.10064547 = weight(_text_:evolution in 2853) [ClassicSimilarity], result of:
              0.10064547 = score(doc=2853,freq=4.0), product of:
                0.2026858 = queryWeight, product of:
                  5.29663 = idf(docFreq=601, maxDocs=44218)
                  0.03826694 = queryNorm
                0.49655905 = fieldWeight in 2853, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  5.29663 = idf(docFreq=601, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2853)
          0.33333334 = coord(1/3)
      0.4 = coord(2/5)
    
    Abstract
    The objective of this paper is to propose a hierarchical topic evolution model (HTEM) that can organize time-varying topics in a hierarchy and discover their evolutions with multiple timescales. In the proposed HTEM, topics near the root of the hierarchy are more abstract and also evolve in the longer timescales than those near the leaves. To achieve this goal, the distance-dependent Chinese restaurant process (ddCRP) is extended to a new nested process that is able to simultaneously model the dependencies among data and the relationship between clusters. The HTEM is proposed based on the new process for time-stamped documents, in which the timestamp is utilized to measure the dependencies among documents. Moreover, an efficient Gibbs sampler is developed for the proposed HTEM. Our experimental results on two popular real-world data sets verify that the proposed HTEM can capture coherent topics and discover their hierarchical evolutions. It also outperforms the baseline model in terms of likelihood on held-out data.
    Language
    e
  8. Teich, E.; Degaetano-Ortlieb, S.; Fankhauser, P.; Kermes, H.; Lapshinova-Koltunski, E.: ¬The linguistic construal of disciplinarity : a data-mining approach using register features (2016) 0.01
    0.013120042 = product of:
      0.032800104 = sum of:
        0.009077741 = weight(_text_:e in 3015) [ClassicSimilarity], result of:
          0.009077741 = score(doc=3015,freq=6.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.16503859 = fieldWeight in 3015, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.046875 = fieldNorm(doc=3015)
        0.023722364 = product of:
          0.07116709 = sum of:
            0.07116709 = weight(_text_:evolution in 3015) [ClassicSimilarity], result of:
              0.07116709 = score(doc=3015,freq=2.0), product of:
                0.2026858 = queryWeight, product of:
                  5.29663 = idf(docFreq=601, maxDocs=44218)
                  0.03826694 = queryNorm
                0.35112026 = fieldWeight in 3015, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  5.29663 = idf(docFreq=601, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3015)
          0.33333334 = coord(1/3)
      0.4 = coord(2/5)
    
    Abstract
    We analyze the linguistic evolution of selected scientific disciplines over a 30-year time span (1970s to 2000s). Our focus is on four highly specialized disciplines at the boundaries of computer science that emerged during that time: computational linguistics, bioinformatics, digital construction, and microelectronics. Our analysis is driven by the question whether these disciplines develop a distinctive language use-both individually and collectively-over the given time period. The data set is the English Scientific Text Corpus (scitex), which includes texts from the 1970s/1980s and early 2000s. Our theoretical basis is register theory. In terms of methods, we combine corpus-based methods of feature extraction (various aggregated features [part-of-speech based], n-grams, lexico-grammatical patterns) and automatic text classification. The results of our research are directly relevant to the study of linguistic variation and languages for specific purposes (LSP) and have implications for various natural language processing (NLP) tasks, for example, authorship attribution, text mining, or training NLP tools.
    Language
    e
  9. Matson, L.D.; Bonski, D.J.: Do digital libraries need librarians? (1997) 0.01
    0.011090646 = product of:
      0.027726613 = sum of:
        0.006988047 = weight(_text_:e in 1737) [ClassicSimilarity], result of:
          0.006988047 = score(doc=1737,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.12704675 = fieldWeight in 1737, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0625 = fieldNorm(doc=1737)
        0.020738566 = product of:
          0.041477133 = sum of:
            0.041477133 = weight(_text_:22 in 1737) [ClassicSimilarity], result of:
              0.041477133 = score(doc=1737,freq=2.0), product of:
                0.1340043 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03826694 = queryNorm
                0.30952093 = fieldWeight in 1737, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1737)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Date
    22.11.1998 18:57:22
    Language
    e
  10. Amir, A.; Feldman, R.; Kashi, R.: ¬A new and versatile method for association generation (1997) 0.01
    0.011090646 = product of:
      0.027726613 = sum of:
        0.006988047 = weight(_text_:e in 1270) [ClassicSimilarity], result of:
          0.006988047 = score(doc=1270,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.12704675 = fieldWeight in 1270, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0625 = fieldNorm(doc=1270)
        0.020738566 = product of:
          0.041477133 = sum of:
            0.041477133 = weight(_text_:22 in 1270) [ClassicSimilarity], result of:
              0.041477133 = score(doc=1270,freq=2.0), product of:
                0.1340043 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03826694 = queryNorm
                0.30952093 = fieldWeight in 1270, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1270)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Language
    e
    Source
    Information systems. 22(1997) nos.5/6, S.333-347
  11. Hofstede, A.H.M. ter; Proper, H.A.; Van der Weide, T.P.: Exploiting fact verbalisation in conceptual information modelling (1997) 0.01
    0.009704315 = product of:
      0.024260787 = sum of:
        0.006114541 = weight(_text_:e in 2908) [ClassicSimilarity], result of:
          0.006114541 = score(doc=2908,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.1111659 = fieldWeight in 2908, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2908)
        0.018146247 = product of:
          0.036292493 = sum of:
            0.036292493 = weight(_text_:22 in 2908) [ClassicSimilarity], result of:
              0.036292493 = score(doc=2908,freq=2.0), product of:
                0.1340043 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03826694 = queryNorm
                0.2708308 = fieldWeight in 2908, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2908)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Language
    e
    Source
    Information systems. 22(1997) nos.5/6, S.349-385
  12. Schwartz, F.; Fang, Y.C.: Citation data analysis on hydrogeology (2007) 0.01
    0.0077235745 = product of:
      0.019308936 = sum of:
        0.0034940236 = weight(_text_:e in 433) [ClassicSimilarity], result of:
          0.0034940236 = score(doc=433,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.063523374 = fieldWeight in 433, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03125 = fieldNorm(doc=433)
        0.015814912 = product of:
          0.04744473 = sum of:
            0.04744473 = weight(_text_:evolution in 433) [ClassicSimilarity], result of:
              0.04744473 = score(doc=433,freq=2.0), product of:
                0.2026858 = queryWeight, product of:
                  5.29663 = idf(docFreq=601, maxDocs=44218)
                  0.03826694 = queryNorm
                0.23408018 = fieldWeight in 433, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  5.29663 = idf(docFreq=601, maxDocs=44218)
                  0.03125 = fieldNorm(doc=433)
          0.33333334 = coord(1/3)
      0.4 = coord(2/5)
    
    Abstract
    This article explores the status of research in hydrogeology using data mining techniques. First we try to explain what citation analysis is and review some of the previous work on citation analysis. The main idea in this article is to address some common issues about citation numbers and the use of these data. To validate the use of citation numbers, we compare the citation patterns for Water Resources Research papers in the 1980s with those in the 1990s. The citation growths for highly cited authors from the 1980s are used to examine whether it is possible to predict the citation patterns for highly-cited authors in the 1990s. If the citation data prove to be steady and stable, these numbers then can be used to explore the evolution of science in hydrogeology. The famous quotation, "If you are not the lead dog, the scenery never changes," attributed to Lee Iacocca, points to the importance of an entrepreneurial spirit in all forms of endeavor. In the case of hydrogeological research, impact analysis makes it clear how important it is to be a pioneer. Statistical correlation coefficients are used to retrieve papers among a collection of 2,847 papers before and after 1991 sharing the same topics with 273 papers in 1991 in Water Resources Research. The numbers of papers before and after 1991 are then plotted against various levels of citations for papers in 1991 to compare the distributions of paper population before and after that year. The similarity metrics based on word counts can ensure that the "before" papers are like ancestors and "after" papers are descendants in the same type of research. This exercise gives us an idea of how many papers are populated before and after 1991 (1991 is chosen based on balanced numbers of papers before and after that year). In addition, the impact of papers is measured in terms of citation presented as "percentile," a relative measure based on rankings in one year, in order to minimize the effect of time.
    Language
    e
  13. Hallonsten, O.; Holmberg, D.: Analyzing structural stratification in the Swedish higher education system : data contextualization with policy-history analysis (2013) 0.01
    0.0069316537 = product of:
      0.017329134 = sum of:
        0.00436753 = weight(_text_:e in 668) [ClassicSimilarity], result of:
          0.00436753 = score(doc=668,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.07940422 = fieldWeight in 668, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0390625 = fieldNorm(doc=668)
        0.012961605 = product of:
          0.02592321 = sum of:
            0.02592321 = weight(_text_:22 in 668) [ClassicSimilarity], result of:
              0.02592321 = score(doc=668,freq=2.0), product of:
                0.1340043 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03826694 = queryNorm
                0.19345059 = fieldWeight in 668, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=668)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Date
    22. 3.2013 19:43:01
    Language
    e
  14. Vaughan, L.; Chen, Y.: Data mining from web search queries : a comparison of Google trends and Baidu index (2015) 0.01
    0.0069316537 = product of:
      0.017329134 = sum of:
        0.00436753 = weight(_text_:e in 1605) [ClassicSimilarity], result of:
          0.00436753 = score(doc=1605,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.07940422 = fieldWeight in 1605, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1605)
        0.012961605 = product of:
          0.02592321 = sum of:
            0.02592321 = weight(_text_:22 in 1605) [ClassicSimilarity], result of:
              0.02592321 = score(doc=1605,freq=2.0), product of:
                0.1340043 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03826694 = queryNorm
                0.19345059 = fieldWeight in 1605, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1605)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Language
    e
    Source
    Journal of the Association for Information Science and Technology. 66(2015) no.1, S.13-22
  15. Fonseca, F.; Marcinkowski, M.; Davis, C.: Cyber-human systems of thought and understanding (2019) 0.01
    0.0069316537 = product of:
      0.017329134 = sum of:
        0.00436753 = weight(_text_:e in 5011) [ClassicSimilarity], result of:
          0.00436753 = score(doc=5011,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.07940422 = fieldWeight in 5011, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5011)
        0.012961605 = product of:
          0.02592321 = sum of:
            0.02592321 = weight(_text_:22 in 5011) [ClassicSimilarity], result of:
              0.02592321 = score(doc=5011,freq=2.0), product of:
                0.1340043 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03826694 = queryNorm
                0.19345059 = fieldWeight in 5011, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5011)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Date
    7. 3.2019 16:32:22
    Language
    e
  16. Jäger, L.: Von Big Data zu Big Brother (2018) 0.01
    0.005545323 = product of:
      0.0138633065 = sum of:
        0.0034940236 = weight(_text_:e in 5234) [ClassicSimilarity], result of:
          0.0034940236 = score(doc=5234,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.063523374 = fieldWeight in 5234, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03125 = fieldNorm(doc=5234)
        0.010369283 = product of:
          0.020738566 = sum of:
            0.020738566 = weight(_text_:22 in 5234) [ClassicSimilarity], result of:
              0.020738566 = score(doc=5234,freq=2.0), product of:
                0.1340043 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03826694 = queryNorm
                0.15476047 = fieldWeight in 5234, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=5234)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    1983 bewegte ein einziges Thema die gesamte Bundesrepublik: die geplante Volkszählung. Jeder Haushalt in Westdeutschland sollte Fragebögen mit 36 Fragen zur Wohnsituation, den im Haushalt lebenden Personen und über ihre Einkommensverhältnisse ausfüllen. Es regte sich massiver Widerstand, hunderte Bürgerinitiativen formierten sich im ganzen Land gegen die Befragung. Man wollte nicht "erfasst" werden, die Privatsphäre war heilig. Es bestand die (berechtigte) Sorge, dass die Antworten auf den eigentlich anonymisierten Fragebögen Rückschlüsse auf die Identität der Befragten zulassen. Das Bundesverfassungsgericht gab den Klägern gegen den Zensus Recht: Die geplante Volkszählung verstieß gegen den Datenschutz und damit auch gegen das Grundgesetz. Sie wurde gestoppt. Nur eine Generation später geben wir sorglos jedes Mal beim Einkaufen die Bonuskarte der Supermarktkette heraus, um ein paar Punkte für ein Geschenk oder Rabatte beim nächsten Einkauf zu sammeln. Und dabei wissen wir sehr wohl, dass der Supermarkt damit unser Konsumverhalten bis ins letzte Detail erfährt. Was wir nicht wissen, ist, wer noch Zugang zu diesen Daten erhält. Deren Käufer bekommen nicht nur Zugriff auf unsere Einkäufe, sondern können über sie auch unsere Gewohnheiten, persönlichen Vorlieben und Einkommen ermitteln. Genauso unbeschwert surfen wir im Internet, googeln und shoppen, mailen und chatten. Google, Facebook und Microsoft schauen bei all dem nicht nur zu, sondern speichern auf alle Zeiten alles, was wir von uns geben, was wir einkaufen, was wir suchen, und verwenden es für ihre eigenen Zwecke. Sie durchstöbern unsere E-Mails, kennen unser persönliches Zeitmanagement, verfolgen unseren momentanen Standort, wissen um unsere politischen, religiösen und sexuellen Präferenzen (wer kennt ihn nicht, den Button "an Männern interessiert" oder "an Frauen interessiert"?), unsere engsten Freunde, mit denen wir online verbunden sind, unseren Beziehungsstatus, welche Schule wir besuchen oder besucht haben und vieles mehr.
    Date
    22. 1.2018 11:33:49
  17. Lusti, M.: Data Warehousing and Data Mining : Eine Einführung in entscheidungsunterstützende Systeme (1999) 0.00
    0.0041477135 = product of:
      0.020738566 = sum of:
        0.020738566 = product of:
          0.041477133 = sum of:
            0.041477133 = weight(_text_:22 in 4261) [ClassicSimilarity], result of:
              0.041477133 = score(doc=4261,freq=2.0), product of:
                0.1340043 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03826694 = queryNorm
                0.30952093 = fieldWeight in 4261, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4261)
          0.5 = coord(1/2)
      0.2 = coord(1/5)
    
    Date
    17. 7.2002 19:22:06
  18. Lackes, R.; Tillmanns, C.: Data Mining für die Unternehmenspraxis : Entscheidungshilfen und Fallstudien mit führenden Softwarelösungen (2006) 0.00
    0.003110785 = product of:
      0.015553925 = sum of:
        0.015553925 = product of:
          0.03110785 = sum of:
            0.03110785 = weight(_text_:22 in 1383) [ClassicSimilarity], result of:
              0.03110785 = score(doc=1383,freq=2.0), product of:
                0.1340043 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03826694 = queryNorm
                0.23214069 = fieldWeight in 1383, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1383)
          0.5 = coord(1/2)
      0.2 = coord(1/5)
    
    Date
    22. 3.2008 14:46:06
  19. Howlett, D.: Digging deep for treasure (1998) 0.00
    0.002795219 = product of:
      0.013976094 = sum of:
        0.013976094 = weight(_text_:e in 4544) [ClassicSimilarity], result of:
          0.013976094 = score(doc=4544,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.2540935 = fieldWeight in 4544, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.125 = fieldNorm(doc=4544)
      0.2 = coord(1/5)
    
    Language
    e
  20. Tunbridge, N.: Semiology put to data mining (1999) 0.00
    0.002795219 = product of:
      0.013976094 = sum of:
        0.013976094 = weight(_text_:e in 6782) [ClassicSimilarity], result of:
          0.013976094 = score(doc=6782,freq=2.0), product of:
            0.055003747 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03826694 = queryNorm
            0.2540935 = fieldWeight in 6782, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.125 = fieldNorm(doc=6782)
      0.2 = coord(1/5)
    
    Language
    e

Years

Languages

  • e 134
  • d 10
  • More… Less…

Types