Document (#40888)

Author
Wattenberg, M.
Viégas, F.
Johnson, I.
Title
How to use t-SNE effectively
Source
Distill, [http://doi.org/10.23915/distill.00002]
Year
2016
Abstract
Although extremely useful for visualizing high-dimensional data, t-SNE plots can sometimes be mysterious or misleading. By exploring how it behaves in simple cases, we can learn to use it more effectively. We'll walk through a series of simple examples to illustrate what t-SNE diagrams can and cannot show. The t-SNE technique really is useful-but only if you know how to interpret it.
Content
Vgl.: https://distill.pub/2016/misread-tsne/.
Theme
Data Mining
Visualisierung
Object
tSNE

Similar documents (author)

  1. Johnson, S.W.: Do-it-yourself CD-ROMs (1992) 4.57
    4.5717874 = sum of:
      4.5717874 = weight(author_txt:johnson in 4285) [ClassicSimilarity], result of:
        4.5717874 = score(doc=4285,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.314861 = idf(docFreq=79, maxDocs=44218)
            0.13670799 = queryNorm
          4.571788 = fieldWeight in 4285, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.314861 = idf(docFreq=79, maxDocs=44218)
            0.625 = fieldNorm(doc=4285)
    
  2. Johnson, S.: Virtual documents : the past, the present and some standards for the future (1993) 4.57
    4.5717874 = sum of:
      4.5717874 = weight(author_txt:johnson in 4421) [ClassicSimilarity], result of:
        4.5717874 = score(doc=4421,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.314861 = idf(docFreq=79, maxDocs=44218)
            0.13670799 = queryNorm
          4.571788 = fieldWeight in 4421, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.314861 = idf(docFreq=79, maxDocs=44218)
            0.625 = fieldNorm(doc=4421)
    
  3. Johnson, R.D.: Public libraries and the Internet / NREN : new challenges, new opportunities (1992) 4.57
    4.5717874 = sum of:
      4.5717874 = weight(author_txt:johnson in 6248) [ClassicSimilarity], result of:
        4.5717874 = score(doc=6248,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.314861 = idf(docFreq=79, maxDocs=44218)
            0.13670799 = queryNorm
          4.571788 = fieldWeight in 6248, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.314861 = idf(docFreq=79, maxDocs=44218)
            0.625 = fieldNorm(doc=6248)
    
  4. Johnson, F.C.: ¬A classification of ellipsis based on a corpus of information seeking dialogues (1994) 4.57
    4.5717874 = sum of:
      4.5717874 = weight(author_txt:johnson in 7803) [ClassicSimilarity], result of:
        4.5717874 = score(doc=7803,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.314861 = idf(docFreq=79, maxDocs=44218)
            0.13670799 = queryNorm
          4.571788 = fieldWeight in 7803, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.314861 = idf(docFreq=79, maxDocs=44218)
            0.625 = fieldNorm(doc=7803)
    
  5. Johnson, A.: Information brokers (1991) 4.57
    4.5717874 = sum of:
      4.5717874 = weight(author_txt:johnson in 1294) [ClassicSimilarity], result of:
        4.5717874 = score(doc=1294,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.314861 = idf(docFreq=79, maxDocs=44218)
            0.13670799 = queryNorm
          4.571788 = fieldWeight in 1294, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.314861 = idf(docFreq=79, maxDocs=44218)
            0.625 = fieldNorm(doc=1294)
    

Similar documents (content)

  1. Maaten, L. van den: Accelerating t-SNE using Tree-Based Algorithms (2014) 0.12
    0.115957595 = sum of:
      0.115957595 = product of:
        0.57978797 = sum of:
          0.04169794 = weight(abstract_txt:high in 3886) [ClassicSimilarity], result of:
            0.04169794 = score(doc=3886,freq=1.0), product of:
              0.09152651 = queryWeight, product of:
                1.0157624 = boost
                4.8595543 = idf(docFreq=931, maxDocs=44218)
                0.018542074 = queryNorm
              0.4555832 = fieldWeight in 3886, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8595543 = idf(docFreq=931, maxDocs=44218)
                0.09375 = fieldNorm(doc=3886)
          0.06346424 = weight(abstract_txt:technique in 3886) [ClassicSimilarity], result of:
            0.06346424 = score(doc=3886,freq=1.0), product of:
              0.12110346 = queryWeight, product of:
                1.1684146 = boost
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.018542074 = queryNorm
              0.52404976 = fieldWeight in 3886, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.09375 = fieldNorm(doc=3886)
          0.09002591 = weight(abstract_txt:learn in 3886) [ClassicSimilarity], result of:
            0.09002591 = score(doc=3886,freq=1.0), product of:
              0.15289108 = queryWeight, product of:
                1.3128339 = boost
                6.280787 = idf(docFreq=224, maxDocs=44218)
                0.018542074 = queryNorm
              0.5888238 = fieldWeight in 3886, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.280787 = idf(docFreq=224, maxDocs=44218)
                0.09375 = fieldNorm(doc=3886)
          0.11457877 = weight(abstract_txt:dimensional in 3886) [ClassicSimilarity], result of:
            0.11457877 = score(doc=3886,freq=1.0), product of:
              0.17955875 = queryWeight, product of:
                1.4227284 = boost
                6.806538 = idf(docFreq=132, maxDocs=44218)
                0.018542074 = queryNorm
              0.63811296 = fieldWeight in 3886, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.806538 = idf(docFreq=132, maxDocs=44218)
                0.09375 = fieldNorm(doc=3886)
          0.2700211 = weight(abstract_txt:plots in 3886) [ClassicSimilarity], result of:
            0.2700211 = score(doc=3886,freq=1.0), product of:
              0.31798184 = queryWeight, product of:
                1.893302 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.018542074 = queryNorm
              0.8491715 = fieldWeight in 3886, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.09375 = fieldNorm(doc=3886)
        0.2 = coord(5/25)
    
  2. Maaten, L. van den; Hinton, G.: Visualizing data using t-SNE (2008) 0.08
    0.08226959 = sum of:
      0.08226959 = product of:
        0.41134793 = sum of:
          0.039313193 = weight(abstract_txt:high in 3888) [ClassicSimilarity], result of:
            0.039313193 = score(doc=3888,freq=2.0), product of:
              0.09152651 = queryWeight, product of:
                1.0157624 = boost
                4.8595543 = idf(docFreq=931, maxDocs=44218)
                0.018542074 = queryNorm
              0.42952797 = fieldWeight in 3888, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.8595543 = idf(docFreq=931, maxDocs=44218)
                0.0625 = fieldNorm(doc=3888)
          0.059834655 = weight(abstract_txt:technique in 3888) [ClassicSimilarity], result of:
            0.059834655 = score(doc=3888,freq=2.0), product of:
              0.12110346 = queryWeight, product of:
                1.1684146 = boost
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.018542074 = queryNorm
              0.49407884 = fieldWeight in 3888, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.0625 = fieldNorm(doc=3888)
          0.053679723 = weight(abstract_txt:illustrate in 3888) [ClassicSimilarity], result of:
            0.053679723 = score(doc=3888,freq=1.0), product of:
              0.14192912 = queryWeight, product of:
                1.264895 = boost
                6.0514402 = idf(docFreq=282, maxDocs=44218)
                0.018542074 = queryNorm
              0.37821501 = fieldWeight in 3888, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0514402 = idf(docFreq=282, maxDocs=44218)
                0.0625 = fieldNorm(doc=3888)
          0.15277168 = weight(abstract_txt:dimensional in 3888) [ClassicSimilarity], result of:
            0.15277168 = score(doc=3888,freq=4.0), product of:
              0.17955875 = queryWeight, product of:
                1.4227284 = boost
                6.806538 = idf(docFreq=132, maxDocs=44218)
                0.018542074 = queryNorm
              0.85081726 = fieldWeight in 3888, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.806538 = idf(docFreq=132, maxDocs=44218)
                0.0625 = fieldNorm(doc=3888)
          0.10574865 = weight(abstract_txt:visualizing in 3888) [ClassicSimilarity], result of:
            0.10574865 = score(doc=3888,freq=1.0), product of:
              0.22303921 = queryWeight, product of:
                1.5856572 = boost
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.018542074 = queryNorm
              0.47412583 = fieldWeight in 3888, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.0625 = fieldNorm(doc=3888)
        0.2 = coord(5/25)
    
  3. Hochheiser, H.; Shneiderman, B.: Using interactive visualizations of WWW log data to characterize access patterns and inform site design (2001) 0.08
    0.07811444 = sum of:
      0.07811444 = product of:
        0.3905722 = sum of:
          0.0331556 = weight(abstract_txt:although in 5765) [ClassicSimilarity], result of:
            0.0331556 = score(doc=5765,freq=1.0), product of:
              0.08870796 = queryWeight, product of:
                4.7841444 = idf(docFreq=1004, maxDocs=44218)
                0.018542074 = queryNorm
              0.3737613 = fieldWeight in 5765, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7841444 = idf(docFreq=1004, maxDocs=44218)
                0.078125 = fieldNorm(doc=5765)
          0.050389152 = weight(abstract_txt:series in 5765) [ClassicSimilarity], result of:
            0.050389152 = score(doc=5765,freq=1.0), product of:
              0.11725985 = queryWeight, product of:
                1.1497234 = boost
                5.500443 = idf(docFreq=490, maxDocs=44218)
                0.018542074 = queryNorm
              0.4297221 = fieldWeight in 5765, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.500443 = idf(docFreq=490, maxDocs=44218)
                0.078125 = fieldNorm(doc=5765)
          0.09548231 = weight(abstract_txt:dimensional in 5765) [ClassicSimilarity], result of:
            0.09548231 = score(doc=5765,freq=1.0), product of:
              0.17955875 = queryWeight, product of:
                1.4227284 = boost
                6.806538 = idf(docFreq=132, maxDocs=44218)
                0.018542074 = queryNorm
              0.5317608 = fieldWeight in 5765, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.806538 = idf(docFreq=132, maxDocs=44218)
                0.078125 = fieldNorm(doc=5765)
          0.11448151 = weight(abstract_txt:interpret in 5765) [ClassicSimilarity], result of:
            0.11448151 = score(doc=5765,freq=1.0), product of:
              0.20265074 = queryWeight, product of:
                1.5114466 = boost
                7.230979 = idf(docFreq=86, maxDocs=44218)
                0.018542074 = queryNorm
              0.56492025 = fieldWeight in 5765, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.230979 = idf(docFreq=86, maxDocs=44218)
                0.078125 = fieldNorm(doc=5765)
          0.09706359 = weight(abstract_txt:useful in 5765) [ClassicSimilarity], result of:
            0.09706359 = score(doc=5765,freq=2.0), product of:
              0.18153578 = queryWeight, product of:
                2.0230882 = boost
                4.839373 = idf(docFreq=950, maxDocs=44218)
                0.018542074 = queryNorm
              0.53468025 = fieldWeight in 5765, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.839373 = idf(docFreq=950, maxDocs=44218)
                0.078125 = fieldNorm(doc=5765)
        0.2 = coord(5/25)
    
  4. Schamber, L.: Time-line interviews and inductive content analysis : their effectiveness for exploring cognitive behaviors (2000) 0.07
    0.0654321 = sum of:
      0.0654321 = product of:
        0.40895063 = sum of:
          0.05259475 = weight(abstract_txt:examples in 4808) [ClassicSimilarity], result of:
            0.05259475 = score(doc=4808,freq=1.0), product of:
              0.096412696 = queryWeight, product of:
                1.0425234 = boost
                4.9875827 = idf(docFreq=819, maxDocs=44218)
                0.018542074 = queryNorm
              0.54551685 = fieldWeight in 4808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9875827 = idf(docFreq=819, maxDocs=44218)
                0.109375 = fieldNorm(doc=4808)
          0.10223742 = weight(abstract_txt:exploring in 4808) [ClassicSimilarity], result of:
            0.10223742 = score(doc=4808,freq=1.0), product of:
              0.15016863 = queryWeight, product of:
                1.301093 = boost
                6.2246165 = idf(docFreq=237, maxDocs=44218)
                0.018542074 = queryNorm
              0.6808174 = fieldWeight in 4808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2246165 = idf(docFreq=237, maxDocs=44218)
                0.109375 = fieldNorm(doc=4808)
          0.1580304 = weight(abstract_txt:extremely in 4808) [ClassicSimilarity], result of:
            0.1580304 = score(doc=4808,freq=1.0), product of:
              0.20075502 = queryWeight, product of:
                1.5043604 = boost
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.018542074 = queryNorm
              0.78718036 = fieldWeight in 4808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.109375 = fieldNorm(doc=4808)
          0.09608805 = weight(abstract_txt:useful in 4808) [ClassicSimilarity], result of:
            0.09608805 = score(doc=4808,freq=1.0), product of:
              0.18153578 = queryWeight, product of:
                2.0230882 = boost
                4.839373 = idf(docFreq=950, maxDocs=44218)
                0.018542074 = queryNorm
              0.5293064 = fieldWeight in 4808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.839373 = idf(docFreq=950, maxDocs=44218)
                0.109375 = fieldNorm(doc=4808)
        0.16 = coord(4/25)
    
  5. Costa Carvalho, A. da; Rossi, C.; Moura, E.S. de; Silva, A.S. da; Fernandes, D.: LePrEF: Learn to precompute evidence fusion for efficient query evaluation (2012) 0.06
    0.06296476 = sum of:
      0.06296476 = product of:
        0.31482378 = sum of:
          0.039313193 = weight(abstract_txt:high in 278) [ClassicSimilarity], result of:
            0.039313193 = score(doc=278,freq=2.0), product of:
              0.09152651 = queryWeight, product of:
                1.0157624 = boost
                4.8595543 = idf(docFreq=931, maxDocs=44218)
                0.018542074 = queryNorm
              0.42952797 = fieldWeight in 278, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.8595543 = idf(docFreq=931, maxDocs=44218)
                0.0625 = fieldNorm(doc=278)
          0.042309493 = weight(abstract_txt:technique in 278) [ClassicSimilarity], result of:
            0.042309493 = score(doc=278,freq=1.0), product of:
              0.12110346 = queryWeight, product of:
                1.1684146 = boost
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.018542074 = queryNorm
              0.34936652 = fieldWeight in 278, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.0625 = fieldNorm(doc=278)
          0.06001727 = weight(abstract_txt:learn in 278) [ClassicSimilarity], result of:
            0.06001727 = score(doc=278,freq=1.0), product of:
              0.15289108 = queryWeight, product of:
                1.3128339 = boost
                6.280787 = idf(docFreq=224, maxDocs=44218)
                0.018542074 = queryNorm
              0.3925492 = fieldWeight in 278, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.280787 = idf(docFreq=224, maxDocs=44218)
                0.0625 = fieldNorm(doc=278)
          0.073022194 = weight(abstract_txt:simple in 278) [ClassicSimilarity], result of:
            0.073022194 = score(doc=278,freq=1.0), product of:
              0.21953878 = queryWeight, product of:
                2.2247915 = boost
                5.321862 = idf(docFreq=586, maxDocs=44218)
                0.018542074 = queryNorm
              0.3326164 = fieldWeight in 278, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.321862 = idf(docFreq=586, maxDocs=44218)
                0.0625 = fieldNorm(doc=278)
          0.10016162 = weight(abstract_txt:effectively in 278) [ClassicSimilarity], result of:
            0.10016162 = score(doc=278,freq=1.0), product of:
              0.2710247 = queryWeight, product of:
                2.4719412 = boost
                5.913062 = idf(docFreq=324, maxDocs=44218)
                0.018542074 = queryNorm
              0.36956638 = fieldWeight in 278, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.913062 = idf(docFreq=324, maxDocs=44218)
                0.0625 = fieldNorm(doc=278)
        0.2 = coord(5/25)