Search (50 results, page 1 of 3)

  • × theme_ss:"Data Mining"
  • × year_i:[2010 TO 2020}
  1. Ma, Z.; Sun, A.; Cong, G.: On predicting the popularity of newly emerging hashtags in Twitter (2013) 0.02
    0.017230602 = product of:
      0.034461204 = sum of:
        0.0037603125 = weight(_text_:e in 967) [ClassicSimilarity], result of:
          0.0037603125 = score(doc=967,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.07940422 = fieldWeight in 967, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0390625 = fieldNorm(doc=967)
        0.023193657 = weight(_text_:k in 967) [ClassicSimilarity], result of:
          0.023193657 = score(doc=967,freq=2.0), product of:
            0.11761237 = queryWeight, product of:
              3.569778 = idf(docFreq=3384, maxDocs=44218)
              0.03294669 = queryNorm
            0.19720423 = fieldWeight in 967, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.569778 = idf(docFreq=3384, maxDocs=44218)
              0.0390625 = fieldNorm(doc=967)
        0.007507236 = product of:
          0.022521708 = sum of:
            0.022521708 = weight(_text_:29 in 967) [ClassicSimilarity], result of:
              0.022521708 = score(doc=967,freq=2.0), product of:
                0.11589616 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.03294669 = queryNorm
                0.19432661 = fieldWeight in 967, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=967)
          0.33333334 = coord(1/3)
      0.5 = coord(3/6)
    
    Abstract
    Because of Twitter's popularity and the viral nature of information dissemination on Twitter, predicting which Twitter topics will become popular in the near future becomes a task of considerable economic importance. Many Twitter topics are annotated by hashtags. In this article, we propose methods to predict the popularity of new hashtags on Twitter by formulating the problem as a classification task. We use five standard classification models (i.e., Naïve bayes, k-nearest neighbors, decision trees, support vector machines, and logistic regression) for prediction. The main challenge is the identification of effective features for describing new hashtags. We extract 7 content features from a hashtag string and the collection of tweets containing the hashtag and 11 contextual features from the social graph formed by users who have adopted the hashtag. We conducted experiments on a Twitter data set consisting of 31 million tweets from 2 million Singapore-based users. The experimental results show that the standard classifiers using the extracted features significantly outperform the baseline methods that do not use these features. Among the five classifiers, the logistic regression model performs the best in terms of the Micro-F1 measure. We also observe that contextual features are more effective than content features.
    Date
    25. 6.2013 19:05:29
    Language
    e
  2. Chardonnens, A.; Hengchen, S.: Text mining for cultural heritage institutions : a 5-step method for cultural heritage institutions (2017) 0.01
    0.012413344 = product of:
      0.037240032 = sum of:
        0.0060165 = weight(_text_:e in 646) [ClassicSimilarity], result of:
          0.0060165 = score(doc=646,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.12704675 = fieldWeight in 646, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0625 = fieldNorm(doc=646)
        0.031223532 = weight(_text_:u in 646) [ClassicSimilarity], result of:
          0.031223532 = score(doc=646,freq=2.0), product of:
            0.107882105 = queryWeight, product of:
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.03294669 = queryNorm
            0.28942272 = fieldWeight in 646, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.0625 = fieldNorm(doc=646)
      0.33333334 = coord(2/6)
    
    Language
    e
    Source
    Everything changes, everything stays the same? - Understanding information spaces : Proceedings of the 15th International Symposium of Information Science (ISI 2017), Berlin/Germany, 13th - 15th March 2017. Eds.: M. Gäde, V. Trkulja u. V. Petras
  3. Huvila, I.: Mining qualitative data on human information behaviour from the Web (2010) 0.01
    0.010861676 = product of:
      0.03258503 = sum of:
        0.0052644373 = weight(_text_:e in 4676) [ClassicSimilarity], result of:
          0.0052644373 = score(doc=4676,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.1111659 = fieldWeight in 4676, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4676)
        0.02732059 = weight(_text_:u in 4676) [ClassicSimilarity], result of:
          0.02732059 = score(doc=4676,freq=2.0), product of:
            0.107882105 = queryWeight, product of:
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.03294669 = queryNorm
            0.25324488 = fieldWeight in 4676, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4676)
      0.33333334 = coord(2/6)
    
    Language
    e
    Source
    Information und Wissen: global, sozial und frei? Proceedings des 12. Internationalen Symposiums für Informationswissenschaft (ISI 2011) ; Hildesheim, 9. - 11. März 2011. Hrsg.: J. Griesbaum, T. Mandl u. C. Womser-Hacker
  4. Song, J.; Huang, Y.; Qi, X.; Li, Y.; Li, F.; Fu, K.; Huang, T.: Discovering hierarchical topic evolution in time-stamped documents (2016) 0.01
    0.010781588 = product of:
      0.032344762 = sum of:
        0.0045123748 = weight(_text_:e in 2853) [ClassicSimilarity], result of:
          0.0045123748 = score(doc=2853,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.09528506 = fieldWeight in 2853, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.046875 = fieldNorm(doc=2853)
        0.027832389 = weight(_text_:k in 2853) [ClassicSimilarity], result of:
          0.027832389 = score(doc=2853,freq=2.0), product of:
            0.11761237 = queryWeight, product of:
              3.569778 = idf(docFreq=3384, maxDocs=44218)
              0.03294669 = queryNorm
            0.23664509 = fieldWeight in 2853, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.569778 = idf(docFreq=3384, maxDocs=44218)
              0.046875 = fieldNorm(doc=2853)
      0.33333334 = coord(2/6)
    
    Language
    e
  5. Liu, B.: Web data mining : exploring hyperlinks, contents, and usage data (2011) 0.01
    0.009749627 = product of:
      0.029248878 = sum of:
        0.00300825 = weight(_text_:e in 354) [ClassicSimilarity], result of:
          0.00300825 = score(doc=354,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.063523374 = fieldWeight in 354, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03125 = fieldNorm(doc=354)
        0.026240628 = weight(_text_:k in 354) [ClassicSimilarity], result of:
          0.026240628 = score(doc=354,freq=4.0), product of:
            0.11761237 = queryWeight, product of:
              3.569778 = idf(docFreq=3384, maxDocs=44218)
              0.03294669 = queryNorm
            0.22311112 = fieldWeight in 354, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.569778 = idf(docFreq=3384, maxDocs=44218)
              0.03125 = fieldNorm(doc=354)
      0.33333334 = coord(2/6)
    
    Classification
    TZG (FH K)
    GHBS
    TZG (FH K)
    Language
    e
  6. Wongthontham, P.; Abu-Salih, B.: Ontology-based approach for semantic data extraction from social big data : state-of-the-art and research directions (2018) 0.01
    0.009310008 = product of:
      0.027930023 = sum of:
        0.0045123748 = weight(_text_:e in 4097) [ClassicSimilarity], result of:
          0.0045123748 = score(doc=4097,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.09528506 = fieldWeight in 4097, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.046875 = fieldNorm(doc=4097)
        0.023417648 = weight(_text_:u in 4097) [ClassicSimilarity], result of:
          0.023417648 = score(doc=4097,freq=2.0), product of:
            0.107882105 = queryWeight, product of:
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.03294669 = queryNorm
            0.21706703 = fieldWeight in 4097, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.046875 = fieldNorm(doc=4097)
      0.33333334 = coord(2/6)
    
    Language
    e
    Theme
    Semantisches Umfeld in Indexierung u. Retrieval
  7. Mandl, T.: Text mining und data minig (2013) 0.01
    0.0065049026 = product of:
      0.039029416 = sum of:
        0.039029416 = weight(_text_:u in 713) [ClassicSimilarity], result of:
          0.039029416 = score(doc=713,freq=2.0), product of:
            0.107882105 = queryWeight, product of:
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.03294669 = queryNorm
            0.3617784 = fieldWeight in 713, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.078125 = fieldNorm(doc=713)
      0.16666667 = coord(1/6)
    
    Source
    Grundlagen der praktischen Information und Dokumentation. Handbuch zur Einführung in die Informationswissenschaft und -praxis. 6., völlig neu gefaßte Ausgabe. Hrsg. von R. Kuhlen, W. Semar u. D. Strauch. Begründet von Klaus Laisiepen, Ernst Lutterbeck, Karl-Heinrich Meyer-Uhlenried
  8. Mining text data (2012) 0.01
    0.006206672 = product of:
      0.018620016 = sum of:
        0.00300825 = weight(_text_:e in 362) [ClassicSimilarity], result of:
          0.00300825 = score(doc=362,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.063523374 = fieldWeight in 362, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03125 = fieldNorm(doc=362)
        0.015611766 = weight(_text_:u in 362) [ClassicSimilarity], result of:
          0.015611766 = score(doc=362,freq=2.0), product of:
            0.107882105 = queryWeight, product of:
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.03294669 = queryNorm
            0.14471136 = fieldWeight in 362, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.03125 = fieldNorm(doc=362)
      0.33333334 = coord(2/6)
    
    Editor
    Aggarwal, C.C. u. C.X. Zhai
    Language
    e
  9. Liu, X.; Yu, S.; Janssens, F.; Glänzel, W.; Moreau, Y.; Moor, B.de: Weighted hybrid clustering by combining text mining and bibliometrics on a large-scale journal database (2010) 0.00
    0.0045070197 = product of:
      0.0135210585 = sum of:
        0.0045123748 = weight(_text_:e in 3464) [ClassicSimilarity], result of:
          0.0045123748 = score(doc=3464,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.09528506 = fieldWeight in 3464, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.046875 = fieldNorm(doc=3464)
        0.009008683 = product of:
          0.027026048 = sum of:
            0.027026048 = weight(_text_:29 in 3464) [ClassicSimilarity], result of:
              0.027026048 = score(doc=3464,freq=2.0), product of:
                0.11589616 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.03294669 = queryNorm
                0.23319192 = fieldWeight in 3464, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3464)
          0.33333334 = coord(1/3)
      0.33333334 = coord(2/6)
    
    Date
    1. 6.2010 9:29:57
    Language
    e
  10. Qiu, X.Y.; Srinivasan, P.; Hu, Y.: Supervised learning models to predict firm performance with annual reports : an empirical study (2014) 0.00
    0.0045070197 = product of:
      0.0135210585 = sum of:
        0.0045123748 = weight(_text_:e in 1205) [ClassicSimilarity], result of:
          0.0045123748 = score(doc=1205,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.09528506 = fieldWeight in 1205, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.046875 = fieldNorm(doc=1205)
        0.009008683 = product of:
          0.027026048 = sum of:
            0.027026048 = weight(_text_:29 in 1205) [ClassicSimilarity], result of:
              0.027026048 = score(doc=1205,freq=2.0), product of:
                0.11589616 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.03294669 = queryNorm
                0.23319192 = fieldWeight in 1205, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1205)
          0.33333334 = coord(1/3)
      0.33333334 = coord(2/6)
    
    Date
    29. 1.2014 16:46:40
    Language
    e
  11. Tu, Y.-N.; Hsu, S.-L.: Constructing conceptual trajectory maps to trace the development of research fields (2016) 0.00
    0.0037558496 = product of:
      0.011267548 = sum of:
        0.0037603125 = weight(_text_:e in 3059) [ClassicSimilarity], result of:
          0.0037603125 = score(doc=3059,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.07940422 = fieldWeight in 3059, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3059)
        0.007507236 = product of:
          0.022521708 = sum of:
            0.022521708 = weight(_text_:29 in 3059) [ClassicSimilarity], result of:
              0.022521708 = score(doc=3059,freq=2.0), product of:
                0.11589616 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.03294669 = queryNorm
                0.19432661 = fieldWeight in 3059, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3059)
          0.33333334 = coord(1/3)
      0.33333334 = coord(2/6)
    
    Date
    21. 7.2016 19:29:19
    Language
    e
  12. Gill, A.J.; Hinrichs-Krapels, S.; Blanke, T.; Grant, J.; Hedges, M.; Tanner, S.: Insight workflow : systematically combining human and computational methods to explore textual data (2017) 0.00
    0.0037558496 = product of:
      0.011267548 = sum of:
        0.0037603125 = weight(_text_:e in 3682) [ClassicSimilarity], result of:
          0.0037603125 = score(doc=3682,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.07940422 = fieldWeight in 3682, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3682)
        0.007507236 = product of:
          0.022521708 = sum of:
            0.022521708 = weight(_text_:29 in 3682) [ClassicSimilarity], result of:
              0.022521708 = score(doc=3682,freq=2.0), product of:
                0.11589616 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.03294669 = queryNorm
                0.19432661 = fieldWeight in 3682, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3682)
          0.33333334 = coord(1/3)
      0.33333334 = coord(2/6)
    
    Date
    16.11.2017 14:00:29
    Language
    e
  13. Hallonsten, O.; Holmberg, D.: Analyzing structural stratification in the Swedish higher education system : data contextualization with policy-history analysis (2013) 0.00
    0.0037333388 = product of:
      0.011200016 = sum of:
        0.0037603125 = weight(_text_:e in 668) [ClassicSimilarity], result of:
          0.0037603125 = score(doc=668,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.07940422 = fieldWeight in 668, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0390625 = fieldNorm(doc=668)
        0.0074397037 = product of:
          0.02231911 = sum of:
            0.02231911 = weight(_text_:22 in 668) [ClassicSimilarity], result of:
              0.02231911 = score(doc=668,freq=2.0), product of:
                0.1153737 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03294669 = queryNorm
                0.19345059 = fieldWeight in 668, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=668)
          0.33333334 = coord(1/3)
      0.33333334 = coord(2/6)
    
    Date
    22. 3.2013 19:43:01
    Language
    e
  14. Vaughan, L.; Chen, Y.: Data mining from web search queries : a comparison of Google trends and Baidu index (2015) 0.00
    0.0037333388 = product of:
      0.011200016 = sum of:
        0.0037603125 = weight(_text_:e in 1605) [ClassicSimilarity], result of:
          0.0037603125 = score(doc=1605,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.07940422 = fieldWeight in 1605, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1605)
        0.0074397037 = product of:
          0.02231911 = sum of:
            0.02231911 = weight(_text_:22 in 1605) [ClassicSimilarity], result of:
              0.02231911 = score(doc=1605,freq=2.0), product of:
                0.1153737 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03294669 = queryNorm
                0.19345059 = fieldWeight in 1605, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1605)
          0.33333334 = coord(1/3)
      0.33333334 = coord(2/6)
    
    Language
    e
    Source
    Journal of the Association for Information Science and Technology. 66(2015) no.1, S.13-22
  15. Fonseca, F.; Marcinkowski, M.; Davis, C.: Cyber-human systems of thought and understanding (2019) 0.00
    0.0037333388 = product of:
      0.011200016 = sum of:
        0.0037603125 = weight(_text_:e in 5011) [ClassicSimilarity], result of:
          0.0037603125 = score(doc=5011,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.07940422 = fieldWeight in 5011, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5011)
        0.0074397037 = product of:
          0.02231911 = sum of:
            0.02231911 = weight(_text_:22 in 5011) [ClassicSimilarity], result of:
              0.02231911 = score(doc=5011,freq=2.0), product of:
                0.1153737 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03294669 = queryNorm
                0.19345059 = fieldWeight in 5011, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5011)
          0.33333334 = coord(1/3)
      0.33333334 = coord(2/6)
    
    Date
    7. 3.2019 16:32:22
    Language
    e
  16. Jäger, L.: Von Big Data zu Big Brother (2018) 0.00
    0.0029866712 = product of:
      0.008960013 = sum of:
        0.00300825 = weight(_text_:e in 5234) [ClassicSimilarity], result of:
          0.00300825 = score(doc=5234,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.063523374 = fieldWeight in 5234, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03125 = fieldNorm(doc=5234)
        0.005951763 = product of:
          0.017855288 = sum of:
            0.017855288 = weight(_text_:22 in 5234) [ClassicSimilarity], result of:
              0.017855288 = score(doc=5234,freq=2.0), product of:
                0.1153737 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03294669 = queryNorm
                0.15476047 = fieldWeight in 5234, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=5234)
          0.33333334 = coord(1/3)
      0.33333334 = coord(2/6)
    
    Abstract
    1983 bewegte ein einziges Thema die gesamte Bundesrepublik: die geplante Volkszählung. Jeder Haushalt in Westdeutschland sollte Fragebögen mit 36 Fragen zur Wohnsituation, den im Haushalt lebenden Personen und über ihre Einkommensverhältnisse ausfüllen. Es regte sich massiver Widerstand, hunderte Bürgerinitiativen formierten sich im ganzen Land gegen die Befragung. Man wollte nicht "erfasst" werden, die Privatsphäre war heilig. Es bestand die (berechtigte) Sorge, dass die Antworten auf den eigentlich anonymisierten Fragebögen Rückschlüsse auf die Identität der Befragten zulassen. Das Bundesverfassungsgericht gab den Klägern gegen den Zensus Recht: Die geplante Volkszählung verstieß gegen den Datenschutz und damit auch gegen das Grundgesetz. Sie wurde gestoppt. Nur eine Generation später geben wir sorglos jedes Mal beim Einkaufen die Bonuskarte der Supermarktkette heraus, um ein paar Punkte für ein Geschenk oder Rabatte beim nächsten Einkauf zu sammeln. Und dabei wissen wir sehr wohl, dass der Supermarkt damit unser Konsumverhalten bis ins letzte Detail erfährt. Was wir nicht wissen, ist, wer noch Zugang zu diesen Daten erhält. Deren Käufer bekommen nicht nur Zugriff auf unsere Einkäufe, sondern können über sie auch unsere Gewohnheiten, persönlichen Vorlieben und Einkommen ermitteln. Genauso unbeschwert surfen wir im Internet, googeln und shoppen, mailen und chatten. Google, Facebook und Microsoft schauen bei all dem nicht nur zu, sondern speichern auf alle Zeiten alles, was wir von uns geben, was wir einkaufen, was wir suchen, und verwenden es für ihre eigenen Zwecke. Sie durchstöbern unsere E-Mails, kennen unser persönliches Zeitmanagement, verfolgen unseren momentanen Standort, wissen um unsere politischen, religiösen und sexuellen Präferenzen (wer kennt ihn nicht, den Button "an Männern interessiert" oder "an Frauen interessiert"?), unsere engsten Freunde, mit denen wir online verbunden sind, unseren Beziehungsstatus, welche Schule wir besuchen oder besucht haben und vieles mehr.
    Date
    22. 1.2018 11:33:49
  17. Blake, C.: Text mining (2011) 0.00
    0.0017548124 = product of:
      0.010528875 = sum of:
        0.010528875 = weight(_text_:e in 1599) [ClassicSimilarity], result of:
          0.010528875 = score(doc=1599,freq=2.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.2223318 = fieldWeight in 1599, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.109375 = fieldNorm(doc=1599)
      0.16666667 = coord(1/6)
    
    Language
    e
  18. Teich, E.; Degaetano-Ortlieb, S.; Fankhauser, P.; Kermes, H.; Lapshinova-Koltunski, E.: ¬The linguistic construal of disciplinarity : a data-mining approach using register features (2016) 0.00
    0.0013026106 = product of:
      0.007815664 = sum of:
        0.007815664 = weight(_text_:e in 3015) [ClassicSimilarity], result of:
          0.007815664 = score(doc=3015,freq=6.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.16503859 = fieldWeight in 3015, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.046875 = fieldNorm(doc=3015)
      0.16666667 = coord(1/6)
    
    Language
    e
  19. Sarnikar, S.; Zhang, Z.; Zhao, J.L.: Query-performance prediction for effective query routing in domain-specific repositories (2014) 0.00
    0.0010635771 = product of:
      0.0063814623 = sum of:
        0.0063814623 = weight(_text_:e in 1326) [ClassicSimilarity], result of:
          0.0063814623 = score(doc=1326,freq=4.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.13475344 = fieldWeight in 1326, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.046875 = fieldNorm(doc=1326)
      0.16666667 = coord(1/6)
    
    Abstract
    The effective use of corporate memory is becoming increasingly important because every aspect of e-business requires access to information repositories. Unfortunately, less-than-satisfying effectiveness in state-of-the-art information-retrieval techniques is well known, even for some of the best search engines such as Google. In this study, the authors resolve this retrieval ineffectiveness problem by developing a new framework for predicting query performance, which is the first step toward better retrieval effectiveness. Specifically, they examine the relationship between query performance and query context. A query context consists of the query itself, the document collection, and the interaction between the two. The authors first analyze the characteristics of query context and develop various features for predicting query performance. Then, they propose a context-sensitive model for predicting query performance based on the characteristics of the query and the document collection. Finally, they validate this model with respect to five real-world collections of documents and demonstrate its utility in routing queries to the correct repository with high accuracy.
    Language
    e
  20. Bella, A. La; Fronzetti Colladon, A.; Battistoni, E.; Castellan, S.; Francucci, M.: Assessing perceived organizational leadership styles through twitter text mining (2018) 0.00
    0.0010635771 = product of:
      0.0063814623 = sum of:
        0.0063814623 = weight(_text_:e in 2400) [ClassicSimilarity], result of:
          0.0063814623 = score(doc=2400,freq=4.0), product of:
            0.047356583 = queryWeight, product of:
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.03294669 = queryNorm
            0.13475344 = fieldWeight in 2400, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.43737 = idf(docFreq=28552, maxDocs=44218)
              0.046875 = fieldNorm(doc=2400)
      0.16666667 = coord(1/6)
    
    Language
    e

Languages

  • e 47
  • d 3
  • More… Less…

Types

  • a 48
  • el 8
  • m 2
  • s 1
  • More… Less…