Document (#38163)

Author
Landauer, T.K.
Foltz, P.W.
Laham, D.
Title
¬An introduction to Latent Semantic Analysis
Source
Discourse Processes. 25(1998), S.259-284. [http://lsa.colorado.edu/papers/dp1.LSAintro.pdf]
Year
1998
Abstract
Latent Semantic Analysis (LSA) is a theory and method for extracting and representing the contextual-usage meaning of words by statistical computations applied to a large corpus of text (Landauer and Dumais, 1997). The underlying idea is that the aggregate of all the word contexts in which a given word does and does not appear provides a set of mutual constraints that largely determines the similarity of meaning of words and sets of words to each other. The adequacy of LSA's reflection of human knowledge has been established in a variety of ways. For example, its scores overlap those of humans on standard vocabulary and subject matter tests; it mimics human word sorting and category judgments; it simulates word-word and passage-word lexical priming data; and as reported in 3 following articles in this issue, it accurately estimates passage coherence, learnability of passages by individual students, and the quality and quantity of knowledge contained in an essay.
Theme
Semantisches Umfeld in Indexierung u. Retrieval
Object
Latent Semantic Indexing

Similar documents (author)

  1. Furnas, G.W.; Landauer, T.K.: Describing categories of objects for menu retrieval systems (1984) 4.88
    4.8754888 = sum of:
      4.8754888 = weight(author_txt:landauer in 6507) [ClassicSimilarity], result of:
        4.8754888 = fieldWeight in 6507, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.5 = fieldNorm(doc=6507)
    
  2. Gomez, L.; Lochbaum, C.C.; Landauer, T.K.: All the right words: finding what you want as an function of richness of indexing vocabulary (1990) 3.66
    3.6566167 = sum of:
      3.6566167 = weight(author_txt:landauer in 154) [ClassicSimilarity], result of:
        3.6566167 = fieldWeight in 154, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.375 = fieldNorm(doc=154)
    
  3. Furnas, G.W.; Landauer, T.K.; Gomez, L.M.; Dumais, S.T.: ¬The vocabulary problem in human-system communication (1987) 3.05
    3.0471804 = sum of:
      3.0471804 = weight(author_txt:landauer in 7629) [ClassicSimilarity], result of:
        3.0471804 = fieldWeight in 7629, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.3125 = fieldNorm(doc=7629)
    
  4. Deerwester, S.; Dumais, S.; Landauer, T.; Furnass, G.; Beck, L.: Improving information retrieval with latent semantic indexing (1988) 3.05
    3.0471804 = sum of:
      3.0471804 = weight(author_txt:landauer in 2396) [ClassicSimilarity], result of:
        3.0471804 = fieldWeight in 2396, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.3125 = fieldNorm(doc=2396)
    
  5. Deerwester, S.C.; Dumais, S.T.; Landauer, T.K.; Furnas, G.W.; Harshman, R.A.: Indexing by latent semantic analysis (1990) 3.05
    3.0471804 = sum of:
      3.0471804 = weight(author_txt:landauer in 2399) [ClassicSimilarity], result of:
        3.0471804 = fieldWeight in 2399, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.3125 = fieldNorm(doc=2399)
    

Similar documents (content)

  1. Jorge-Botana, G.; León, J.A.; Olmos, R.; Hassan-Montero, Y.: Visualizing polysemy using LSA and the predication algorithm (2010) 0.19
    0.19133909 = sum of:
      0.19133909 = product of:
        0.6833539 = sum of:
          0.022943676 = weight(abstract_txt:analysis in 3696) [ClassicSimilarity], result of:
            0.022943676 = score(doc=3696,freq=2.0), product of:
              0.07104827 = queryWeight, product of:
                1.0484383 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.01854796 = queryNorm
              0.3229308 = fieldWeight in 3696, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0625 = fieldNorm(doc=3696)
          0.029797928 = weight(abstract_txt:semantic in 3696) [ClassicSimilarity], result of:
            0.029797928 = score(doc=3696,freq=1.0), product of:
              0.10655624 = queryWeight, product of:
                1.283972 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.01854796 = queryNorm
              0.2796451 = fieldWeight in 3696, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0625 = fieldNorm(doc=3696)
          0.04859538 = weight(abstract_txt:human in 3696) [ClassicSimilarity], result of:
            0.04859538 = score(doc=3696,freq=2.0), product of:
              0.11717676 = queryWeight, product of:
                1.3464396 = boost
                4.692005 = idf(docFreq=1101, maxDocs=44218)
                0.01854796 = queryNorm
              0.41471857 = fieldWeight in 3696, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.692005 = idf(docFreq=1101, maxDocs=44218)
                0.0625 = fieldNorm(doc=3696)
          0.082270265 = weight(abstract_txt:meaning in 3696) [ClassicSimilarity], result of:
            0.082270265 = score(doc=3696,freq=2.0), product of:
              0.16644603 = queryWeight, product of:
                1.6047332 = boost
                5.592094 = idf(docFreq=447, maxDocs=44218)
                0.01854796 = queryNorm
              0.49427593 = fieldWeight in 3696, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.592094 = idf(docFreq=447, maxDocs=44218)
                0.0625 = fieldNorm(doc=3696)
          0.11392781 = weight(abstract_txt:latent in 3696) [ClassicSimilarity], result of:
            0.11392781 = score(doc=3696,freq=1.0), product of:
              0.26054016 = queryWeight, product of:
                2.0077214 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.01854796 = queryNorm
              0.43727544 = fieldWeight in 3696, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.0625 = fieldNorm(doc=3696)
          0.10824409 = weight(abstract_txt:words in 3696) [ClassicSimilarity], result of:
            0.10824409 = score(doc=3696,freq=2.0), product of:
              0.22877648 = queryWeight, product of:
                2.3041856 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.01854796 = queryNorm
              0.47314343 = fieldWeight in 3696, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.0625 = fieldNorm(doc=3696)
          0.27757475 = weight(abstract_txt:word in 3696) [ClassicSimilarity], result of:
            0.27757475 = score(doc=3696,freq=3.0), product of:
              0.4717459 = queryWeight, product of:
                4.6792994 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.01854796 = queryNorm
              0.5883988 = fieldWeight in 3696, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=3696)
        0.28 = coord(7/25)
    
  2. Dumais, S.T.: Latent semantic analysis (2003) 0.18
    0.17695284 = sum of:
      0.17695284 = product of:
        0.49153566 = sum of:
          0.010548672 = weight(abstract_txt:knowledge in 2462) [ClassicSimilarity], result of:
            0.010548672 = score(doc=2462,freq=2.0), product of:
              0.067183614 = queryWeight, product of:
                1.0195248 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.01854796 = queryNorm
              0.15701257 = fieldWeight in 2462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.016223628 = weight(abstract_txt:analysis in 2462) [ClassicSimilarity], result of:
            0.016223628 = score(doc=2462,freq=4.0), product of:
              0.07104827 = queryWeight, product of:
                1.0484383 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.01854796 = queryNorm
              0.22834657 = fieldWeight in 2462, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.047479052 = weight(abstract_txt:passages in 2462) [ClassicSimilarity], result of:
            0.047479052 = score(doc=2462,freq=1.0), product of:
              0.18314688 = queryWeight, product of:
                1.1902848 = boost
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.01854796 = queryNorm
              0.2592403 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.025805762 = weight(abstract_txt:semantic in 2462) [ClassicSimilarity], result of:
            0.025805762 = score(doc=2462,freq=3.0), product of:
              0.10655624 = queryWeight, product of:
                1.283972 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.01854796 = queryNorm
              0.24217974 = fieldWeight in 2462, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.02975847 = weight(abstract_txt:human in 2462) [ClassicSimilarity], result of:
            0.02975847 = score(doc=2462,freq=3.0), product of:
              0.11717676 = queryWeight, product of:
                1.3464396 = boost
                4.692005 = idf(docFreq=1101, maxDocs=44218)
                0.01854796 = queryNorm
              0.25396222 = fieldWeight in 2462, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.692005 = idf(docFreq=1101, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.033397406 = weight(abstract_txt:does in 2462) [ClassicSimilarity], result of:
            0.033397406 = score(doc=2462,freq=2.0), product of:
              0.14485717 = queryWeight, product of:
                1.4970495 = boost
                5.2168427 = idf(docFreq=651, maxDocs=44218)
                0.01854796 = queryNorm
              0.23055404 = fieldWeight in 2462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2168427 = idf(docFreq=651, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.056963906 = weight(abstract_txt:latent in 2462) [ClassicSimilarity], result of:
            0.056963906 = score(doc=2462,freq=1.0), product of:
              0.26054016 = queryWeight, product of:
                2.0077214 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.01854796 = queryNorm
              0.21863772 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.1325714 = weight(abstract_txt:words in 2462) [ClassicSimilarity], result of:
            0.1325714 = score(doc=2462,freq=12.0), product of:
              0.22877648 = queryWeight, product of:
                2.3041856 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.01854796 = queryNorm
              0.57948 = fieldWeight in 2462, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.13878737 = weight(abstract_txt:word in 2462) [ClassicSimilarity], result of:
            0.13878737 = score(doc=2462,freq=3.0), product of:
              0.4717459 = queryWeight, product of:
                4.6792994 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.01854796 = queryNorm
              0.2941994 = fieldWeight in 2462, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
        0.36 = coord(9/25)
    
  3. Leydesdorff, L.; Zhou, P.: Co-word analysis using the Chinese character set (2008) 0.15
    0.1507071 = sum of:
      0.1507071 = product of:
        0.75353545 = sum of:
          0.024335442 = weight(abstract_txt:analysis in 1970) [ClassicSimilarity], result of:
            0.024335442 = score(doc=1970,freq=1.0), product of:
              0.07104827 = queryWeight, product of:
                1.0484383 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.01854796 = queryNorm
              0.34251985 = fieldWeight in 1970, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.09375 = fieldNorm(doc=1970)
          0.06321095 = weight(abstract_txt:semantic in 1970) [ClassicSimilarity], result of:
            0.06321095 = score(doc=1970,freq=2.0), product of:
              0.10655624 = queryWeight, product of:
                1.283972 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.01854796 = queryNorm
              0.5932168 = fieldWeight in 1970, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.09375 = fieldNorm(doc=1970)
          0.0872608 = weight(abstract_txt:meaning in 1970) [ClassicSimilarity], result of:
            0.0872608 = score(doc=1970,freq=1.0), product of:
              0.16644603 = queryWeight, product of:
                1.6047332 = boost
                5.592094 = idf(docFreq=447, maxDocs=44218)
                0.01854796 = queryNorm
              0.5242588 = fieldWeight in 1970, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.592094 = idf(docFreq=447, maxDocs=44218)
                0.09375 = fieldNorm(doc=1970)
          0.16236614 = weight(abstract_txt:words in 1970) [ClassicSimilarity], result of:
            0.16236614 = score(doc=1970,freq=2.0), product of:
              0.22877648 = queryWeight, product of:
                2.3041856 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.01854796 = queryNorm
              0.7097151 = fieldWeight in 1970, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.09375 = fieldNorm(doc=1970)
          0.4163621 = weight(abstract_txt:word in 1970) [ClassicSimilarity], result of:
            0.4163621 = score(doc=1970,freq=3.0), product of:
              0.4717459 = queryWeight, product of:
                4.6792994 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.01854796 = queryNorm
              0.8825982 = fieldWeight in 1970, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.09375 = fieldNorm(doc=1970)
        0.2 = coord(5/25)
    
  4. Rishel, T.; Perkins, L.A.; Yenduri, S.; Zand, F.: Determining the context of text using augmented latent semantic indexing (2007) 0.14
    0.1363412 = sum of:
      0.1363412 = product of:
        0.681706 = sum of:
          0.04055907 = weight(abstract_txt:analysis in 1316) [ClassicSimilarity], result of:
            0.04055907 = score(doc=1316,freq=4.0), product of:
              0.07104827 = queryWeight, product of:
                1.0484383 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.01854796 = queryNorm
              0.5708664 = fieldWeight in 1316, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.078125 = fieldNorm(doc=1316)
          0.08328774 = weight(abstract_txt:semantic in 1316) [ClassicSimilarity], result of:
            0.08328774 = score(doc=1316,freq=5.0), product of:
              0.10655624 = queryWeight, product of:
                1.283972 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.01854796 = queryNorm
              0.78163177 = fieldWeight in 1316, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.078125 = fieldNorm(doc=1316)
          0.07271733 = weight(abstract_txt:meaning in 1316) [ClassicSimilarity], result of:
            0.07271733 = score(doc=1316,freq=1.0), product of:
              0.16644603 = queryWeight, product of:
                1.6047332 = boost
                5.592094 = idf(docFreq=447, maxDocs=44218)
                0.01854796 = queryNorm
              0.43688235 = fieldWeight in 1316, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.592094 = idf(docFreq=447, maxDocs=44218)
                0.078125 = fieldNorm(doc=1316)
          0.28481954 = weight(abstract_txt:latent in 1316) [ClassicSimilarity], result of:
            0.28481954 = score(doc=1316,freq=4.0), product of:
              0.26054016 = queryWeight, product of:
                2.0077214 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.01854796 = queryNorm
              1.0931886 = fieldWeight in 1316, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.078125 = fieldNorm(doc=1316)
          0.20032233 = weight(abstract_txt:word in 1316) [ClassicSimilarity], result of:
            0.20032233 = score(doc=1316,freq=1.0), product of:
              0.4717459 = queryWeight, product of:
                4.6792994 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.01854796 = queryNorm
              0.4246403 = fieldWeight in 1316, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.078125 = fieldNorm(doc=1316)
        0.2 = coord(5/25)
    
  5. Palmquist, R.A.; Sinkankas, G.M.: Client needs without clients : can we understand information needs without clients present to explain them? (1991) 0.13
    0.13217162 = sum of:
      0.13217162 = product of:
        0.66085804 = sum of:
          0.022377111 = weight(abstract_txt:knowledge in 3688) [ClassicSimilarity], result of:
            0.022377111 = score(doc=3688,freq=1.0), product of:
              0.067183614 = queryWeight, product of:
                1.0195248 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.01854796 = queryNorm
              0.33307394 = fieldWeight in 3688, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.09375 = fieldNorm(doc=3688)
          0.034415513 = weight(abstract_txt:analysis in 3688) [ClassicSimilarity], result of:
            0.034415513 = score(doc=3688,freq=2.0), product of:
              0.07104827 = queryWeight, product of:
                1.0484383 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.01854796 = queryNorm
              0.48439622 = fieldWeight in 3688, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.09375 = fieldNorm(doc=3688)
          0.07289307 = weight(abstract_txt:human in 3688) [ClassicSimilarity], result of:
            0.07289307 = score(doc=3688,freq=2.0), product of:
              0.11717676 = queryWeight, product of:
                1.3464396 = boost
                4.692005 = idf(docFreq=1101, maxDocs=44218)
                0.01854796 = queryNorm
              0.6220778 = fieldWeight in 3688, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.692005 = idf(docFreq=1101, maxDocs=44218)
                0.09375 = fieldNorm(doc=3688)
          0.1148102 = weight(abstract_txt:words in 3688) [ClassicSimilarity], result of:
            0.1148102 = score(doc=3688,freq=1.0), product of:
              0.22877648 = queryWeight, product of:
                2.3041856 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.01854796 = queryNorm
              0.5018444 = fieldWeight in 3688, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.09375 = fieldNorm(doc=3688)
          0.4163621 = weight(abstract_txt:word in 3688) [ClassicSimilarity], result of:
            0.4163621 = score(doc=3688,freq=3.0), product of:
              0.4717459 = queryWeight, product of:
                4.6792994 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.01854796 = queryNorm
              0.8825982 = fieldWeight in 3688, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.09375 = fieldNorm(doc=3688)
        0.2 = coord(5/25)