Document (#43878)

Author
Wiegmann, S.
Title
Hättest du die Titanic überlebt? : Eine kurze Einführung in das Data Mining mit freier Software
Source
API Magazin. 4(2023), Nr.1 [https://journals.sub.uni-hamburg.de/hup3/apimagazin/article/view/130]
Year
2023
Abstract
Am 10. April 1912 ging Elisabeth Walton Allen an Bord der "Titanic", um ihr Hab und Gut nach England zu holen. Eines Nachts wurde sie von ihrer aufgelösten Tante geweckt, deren Kajüte unter Wasser stand. Wie steht es um Elisabeths Chancen und hätte man selbst das Unglück damals überlebt? Das Titanic-Orakel ist eine algorithmusbasierte App, die entsprechende Prognosen aufstellt und im Rahmen des Kurses "Data Science" am Department Information der HAW Hamburg entstanden ist. Dieser Beitrag zeigt Schritt für Schritt, wie die App unter Verwendung freier Software entwickelt wurde. Code und Daten werden zur Nachnutzung bereitgestellt.
Content
Vgl.: https://doi.org/10.15460/apimagazin.2023.4.1.130.
Theme
Data Mining

Similar documents (content)

  1. Lüttcher, B.; Zendel, O.: ¬Eine kurze Geschichte Freier Software : Interview mit Oliver Zendel (2005) 0.17
    0.17122084 = sum of:
      0.17122084 = product of:
        0.611503 = sum of:
          0.013022195 = weight(abstract_txt:eine in 3503) [ClassicSimilarity], result of:
            0.013022195 = score(doc=3503,freq=1.0), product of:
              0.059714075 = queryWeight, product of:
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.017113911 = queryNorm
              0.2180758 = fieldWeight in 3503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.0625 = fieldNorm(doc=3503)
          0.052698296 = weight(abstract_txt:entstanden in 3503) [ClassicSimilarity], result of:
            0.052698296 = score(doc=3503,freq=1.0), product of:
              0.120358005 = queryWeight, product of:
                1.0038854 = boost
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.017113911 = queryNorm
              0.4378462 = fieldWeight in 3503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.0625 = fieldNorm(doc=3503)
          0.06370506 = weight(abstract_txt:damals in 3503) [ClassicSimilarity], result of:
            0.06370506 = score(doc=3503,freq=1.0), product of:
              0.13658191 = queryWeight, product of:
                1.0694076 = boost
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.017113911 = queryNorm
              0.4664238 = fieldWeight in 3503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.0625 = fieldNorm(doc=3503)
          0.062066395 = weight(abstract_txt:software in 3503) [ClassicSimilarity], result of:
            0.062066395 = score(doc=3503,freq=6.0), product of:
              0.0930696 = queryWeight, product of:
                1.248434 = boost
                4.3560514 = idf(docFreq=1541, maxDocs=44218)
                0.017113911 = queryNorm
              0.6668815 = fieldWeight in 3503, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.3560514 = idf(docFreq=1541, maxDocs=44218)
                0.0625 = fieldNorm(doc=3503)
          0.05767273 = weight(abstract_txt:wurde in 3503) [ClassicSimilarity], result of:
            0.05767273 = score(doc=3503,freq=3.0), product of:
              0.111659035 = queryWeight, product of:
                1.3674409 = boost
                4.771292 = idf(docFreq=1017, maxDocs=44218)
                0.017113911 = queryNorm
              0.5165075 = fieldWeight in 3503, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.771292 = idf(docFreq=1017, maxDocs=44218)
                0.0625 = fieldNorm(doc=3503)
          0.04747114 = weight(abstract_txt:unter in 3503) [ClassicSimilarity], result of:
            0.04747114 = score(doc=3503,freq=2.0), product of:
              0.11226138 = queryWeight, product of:
                1.3711243 = boost
                4.7841444 = idf(docFreq=1004, maxDocs=44218)
                0.017113911 = queryNorm
              0.42286262 = fieldWeight in 3503, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7841444 = idf(docFreq=1004, maxDocs=44218)
                0.0625 = fieldNorm(doc=3503)
          0.31486717 = weight(abstract_txt:freier in 3503) [ClassicSimilarity], result of:
            0.31486717 = score(doc=3503,freq=2.0), product of:
              0.39630437 = queryWeight, product of:
                2.5761793 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.017113911 = queryNorm
              0.79450846 = fieldWeight in 3503, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0625 = fieldNorm(doc=3503)
        0.28 = coord(7/25)
    
  2. Lützenkirchen, F.: Multimediale Dokumentenserver als E-Learning Content Repository (2006) 0.10
    0.09984051 = sum of:
      0.09984051 = product of:
        0.41600212 = sum of:
          0.018416164 = weight(abstract_txt:eine in 6050) [ClassicSimilarity], result of:
            0.018416164 = score(doc=6050,freq=2.0), product of:
              0.059714075 = queryWeight, product of:
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.017113911 = queryNorm
              0.30840576 = fieldWeight in 6050, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.0625 = fieldNorm(doc=6050)
          0.052698296 = weight(abstract_txt:entstanden in 6050) [ClassicSimilarity], result of:
            0.052698296 = score(doc=6050,freq=1.0), product of:
              0.120358005 = queryWeight, product of:
                1.0038854 = boost
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.017113911 = queryNorm
              0.4378462 = fieldWeight in 6050, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.0625 = fieldNorm(doc=6050)
          0.06333728 = weight(abstract_txt:hamburg in 6050) [ClassicSimilarity], result of:
            0.06333728 = score(doc=6050,freq=1.0), product of:
              0.13605574 = queryWeight, product of:
                1.0673456 = boost
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.017113911 = queryNorm
              0.4655245 = fieldWeight in 6050, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.0625 = fieldNorm(doc=6050)
          0.025338497 = weight(abstract_txt:software in 6050) [ClassicSimilarity], result of:
            0.025338497 = score(doc=6050,freq=1.0), product of:
              0.0930696 = queryWeight, product of:
                1.248434 = boost
                4.3560514 = idf(docFreq=1541, maxDocs=44218)
                0.017113911 = queryNorm
              0.27225322 = fieldWeight in 6050, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3560514 = idf(docFreq=1541, maxDocs=44218)
                0.0625 = fieldNorm(doc=6050)
          0.033567164 = weight(abstract_txt:unter in 6050) [ClassicSimilarity], result of:
            0.033567164 = score(doc=6050,freq=1.0), product of:
              0.11226138 = queryWeight, product of:
                1.3711243 = boost
                4.7841444 = idf(docFreq=1004, maxDocs=44218)
                0.017113911 = queryNorm
              0.29900903 = fieldWeight in 6050, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7841444 = idf(docFreq=1004, maxDocs=44218)
                0.0625 = fieldNorm(doc=6050)
          0.22264472 = weight(abstract_txt:freier in 6050) [ClassicSimilarity], result of:
            0.22264472 = score(doc=6050,freq=1.0), product of:
              0.39630437 = queryWeight, product of:
                2.5761793 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.017113911 = queryNorm
              0.5618023 = fieldWeight in 6050, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0625 = fieldNorm(doc=6050)
        0.24 = coord(6/25)
    
  3. Lischka, K.: 128 Zeichen für die Welt : Vor 40 Jahren schrieben Fachleute das Alphabet des Computers - und schufen damit dem ASCII-Standard (2003) 0.06
    0.0634931 = sum of:
      0.0634931 = product of:
        0.2645546 = sum of:
          0.017226744 = weight(abstract_txt:eine in 391) [ClassicSimilarity], result of:
            0.017226744 = score(doc=391,freq=7.0), product of:
              0.059714075 = queryWeight, product of:
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.017113911 = queryNorm
              0.28848717 = fieldWeight in 391, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.03125 = fieldNorm(doc=391)
          0.026349148 = weight(abstract_txt:entstanden in 391) [ClassicSimilarity], result of:
            0.026349148 = score(doc=391,freq=1.0), product of:
              0.120358005 = queryWeight, product of:
                1.0038854 = boost
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.017113911 = queryNorm
              0.2189231 = fieldWeight in 391, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.03125 = fieldNorm(doc=391)
          0.021943782 = weight(abstract_txt:software in 391) [ClassicSimilarity], result of:
            0.021943782 = score(doc=391,freq=3.0), product of:
              0.0930696 = queryWeight, product of:
                1.248434 = boost
                4.3560514 = idf(docFreq=1541, maxDocs=44218)
                0.017113911 = queryNorm
              0.2357782 = fieldWeight in 391, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.3560514 = idf(docFreq=1541, maxDocs=44218)
                0.03125 = fieldNorm(doc=391)
          0.033297367 = weight(abstract_txt:wurde in 391) [ClassicSimilarity], result of:
            0.033297367 = score(doc=391,freq=4.0), product of:
              0.111659035 = queryWeight, product of:
                1.3674409 = boost
                4.771292 = idf(docFreq=1017, maxDocs=44218)
                0.017113911 = queryNorm
              0.29820576 = fieldWeight in 391, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.771292 = idf(docFreq=1017, maxDocs=44218)
                0.03125 = fieldNorm(doc=391)
          0.016783582 = weight(abstract_txt:unter in 391) [ClassicSimilarity], result of:
            0.016783582 = score(doc=391,freq=1.0), product of:
              0.11226138 = queryWeight, product of:
                1.3711243 = boost
                4.7841444 = idf(docFreq=1004, maxDocs=44218)
                0.017113911 = queryNorm
              0.14950451 = fieldWeight in 391, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7841444 = idf(docFreq=1004, maxDocs=44218)
                0.03125 = fieldNorm(doc=391)
          0.14895397 = weight(abstract_txt:überlebt in 391) [ClassicSimilarity], result of:
            0.14895397 = score(doc=391,freq=1.0), product of:
              0.48121813 = queryWeight, product of:
                2.838786 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.017113911 = queryNorm
              0.30953524 = fieldWeight in 391, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.03125 = fieldNorm(doc=391)
        0.24 = coord(6/25)
    
  4. Stoyan, H.: Information in der Informatik (2004) 0.06
    0.06270534 = sum of:
      0.06270534 = product of:
        0.26127225 = sum of:
          0.016916327 = weight(abstract_txt:eine in 2959) [ClassicSimilarity], result of:
            0.016916327 = score(doc=2959,freq=3.0), product of:
              0.059714075 = queryWeight, product of:
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.017113911 = queryNorm
              0.28328878 = fieldWeight in 2959, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.046875 = fieldNorm(doc=2959)
          0.042855017 = weight(abstract_txt:entsprechende in 2959) [ClassicSimilarity], result of:
            0.042855017 = score(doc=2959,freq=1.0), product of:
              0.12702939 = queryWeight, product of:
                1.0313326 = boost
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.017113911 = queryNorm
              0.337363 = fieldWeight in 2959, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.046875 = fieldNorm(doc=2959)
          0.047778793 = weight(abstract_txt:damals in 2959) [ClassicSimilarity], result of:
            0.047778793 = score(doc=2959,freq=1.0), product of:
              0.13658191 = queryWeight, product of:
                1.0694076 = boost
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.017113911 = queryNorm
              0.34981787 = fieldWeight in 2959, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.046875 = fieldNorm(doc=2959)
          0.047778793 = weight(abstract_txt:ging in 2959) [ClassicSimilarity], result of:
            0.047778793 = score(doc=2959,freq=1.0), product of:
              0.13658191 = queryWeight, product of:
                1.0694076 = boost
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.017113911 = queryNorm
              0.34981787 = fieldWeight in 2959, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.046875 = fieldNorm(doc=2959)
          0.055997264 = weight(abstract_txt:england in 2959) [ClassicSimilarity], result of:
            0.055997264 = score(doc=2959,freq=1.0), product of:
              0.15182652 = queryWeight, product of:
                1.1275102 = boost
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.017113911 = queryNorm
              0.368824 = fieldWeight in 2959, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.046875 = fieldNorm(doc=2959)
          0.049946055 = weight(abstract_txt:wurde in 2959) [ClassicSimilarity], result of:
            0.049946055 = score(doc=2959,freq=4.0), product of:
              0.111659035 = queryWeight, product of:
                1.3674409 = boost
                4.771292 = idf(docFreq=1017, maxDocs=44218)
                0.017113911 = queryNorm
              0.44730866 = fieldWeight in 2959, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.771292 = idf(docFreq=1017, maxDocs=44218)
                0.046875 = fieldNorm(doc=2959)
        0.24 = coord(6/25)
    
  5. Wiesenmüller, H.: Zehn Jahre 'Functional Requirements for Bibliographic Records' (FRBR) : Vision, Theorie und praktische Anwendung (2008) 0.06
    0.060429797 = sum of:
      0.060429797 = product of:
        0.25179082 = sum of:
          0.021838885 = weight(abstract_txt:eine in 2616) [ClassicSimilarity], result of:
            0.021838885 = score(doc=2616,freq=5.0), product of:
              0.059714075 = queryWeight, product of:
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.017113911 = queryNorm
              0.36572424 = fieldWeight in 2616, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.046875 = fieldNorm(doc=2616)
          0.039679926 = weight(abstract_txt:chancen in 2616) [ClassicSimilarity], result of:
            0.039679926 = score(doc=2616,freq=1.0), product of:
              0.12067491 = queryWeight, product of:
                1.0052061 = boost
                7.014756 = idf(docFreq=107, maxDocs=44218)
                0.017113911 = queryNorm
              0.3288167 = fieldWeight in 2616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.014756 = idf(docFreq=107, maxDocs=44218)
                0.046875 = fieldNorm(doc=2616)
          0.047778793 = weight(abstract_txt:ging in 2616) [ClassicSimilarity], result of:
            0.047778793 = score(doc=2616,freq=1.0), product of:
              0.13658191 = queryWeight, product of:
                1.0694076 = boost
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.017113911 = queryNorm
              0.34981787 = fieldWeight in 2616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.046875 = fieldNorm(doc=2616)
          0.03531719 = weight(abstract_txt:wurde in 2616) [ClassicSimilarity], result of:
            0.03531719 = score(doc=2616,freq=2.0), product of:
              0.111659035 = queryWeight, product of:
                1.3674409 = boost
                4.771292 = idf(docFreq=1017, maxDocs=44218)
                0.017113911 = queryNorm
              0.31629497 = fieldWeight in 2616, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.771292 = idf(docFreq=1017, maxDocs=44218)
                0.046875 = fieldNorm(doc=2616)
          0.025175374 = weight(abstract_txt:unter in 2616) [ClassicSimilarity], result of:
            0.025175374 = score(doc=2616,freq=1.0), product of:
              0.11226138 = queryWeight, product of:
                1.3711243 = boost
                4.7841444 = idf(docFreq=1004, maxDocs=44218)
                0.017113911 = queryNorm
              0.22425677 = fieldWeight in 2616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7841444 = idf(docFreq=1004, maxDocs=44218)
                0.046875 = fieldNorm(doc=2616)
          0.08200065 = weight(abstract_txt:schritt in 2616) [ClassicSimilarity], result of:
            0.08200065 = score(doc=2616,freq=1.0), product of:
              0.2466747 = queryWeight, product of:
                2.032469 = boost
                7.0917172 = idf(docFreq=99, maxDocs=44218)
                0.017113911 = queryNorm
              0.33242425 = fieldWeight in 2616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0917172 = idf(docFreq=99, maxDocs=44218)
                0.046875 = fieldNorm(doc=2616)
        0.24 = coord(6/25)