Search (4 results, page 1 of 1)

  • × author_ss:"Dumais, S."
  1. Gordon, M.D.; Dumais, S.: Using latent semantic indexing for literature based discovery (1998) 0.02
    0.01617885 = product of:
      0.048536547 = sum of:
        0.048536547 = sum of:
          0.007673528 = weight(_text_:a in 4892) [ClassicSimilarity], result of:
            0.007673528 = score(doc=4892,freq=6.0), product of:
              0.04968032 = queryWeight, product of:
                1.153047 = idf(docFreq=37942, maxDocs=44218)
                0.04308612 = queryNorm
              0.1544581 = fieldWeight in 4892, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                1.153047 = idf(docFreq=37942, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4892)
          0.04086302 = weight(_text_:22 in 4892) [ClassicSimilarity], result of:
            0.04086302 = score(doc=4892,freq=2.0), product of:
              0.15088025 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.04308612 = queryNorm
              0.2708308 = fieldWeight in 4892, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4892)
      0.33333334 = coord(1/3)
    
    Abstract
    Latent semantic indexing (LSI) is a statistical technique for improving information retrieval effectiveness. Here, we use LSI to assist in literature-based discoveries. The idea behind literature-based discoveries is that different authors have already published certain underlying scientific ideas that, when taken together, can be connected to hypothesize a new dicovery, and that these connections can be made by exploring the scientific literature. We explore latent semantic indexing's effectiveness on 2 discovery processes: uncovering 'nearby' relationships that are necessary to initiate the literature based discovery process; and discovering more distant relationships that may genuinely generate new discovery hypotheses
    Date
    11. 2.2016 16:22:19
    Type
    a
  2. Teevan, J.; Dumais, S.: Web retrieval, ranking and personalization (2011) 0.00
    0.0014767711 = product of:
      0.004430313 = sum of:
        0.004430313 = product of:
          0.008860626 = sum of:
            0.008860626 = weight(_text_:a in 551) [ClassicSimilarity], result of:
              0.008860626 = score(doc=551,freq=2.0), product of:
                0.04968032 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.04308612 = queryNorm
                0.17835285 = fieldWeight in 551, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.109375 = fieldNorm(doc=551)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Type
    a
  3. Dumais, S.; Chen, H.: Hierarchical classification of Web content (2000) 0.00
    0.0012658038 = product of:
      0.0037974115 = sum of:
        0.0037974115 = product of:
          0.007594823 = sum of:
            0.007594823 = weight(_text_:a in 492) [ClassicSimilarity], result of:
              0.007594823 = score(doc=492,freq=2.0), product of:
                0.04968032 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.04308612 = queryNorm
                0.15287387 = fieldWeight in 492, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.09375 = fieldNorm(doc=492)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Type
    a
  4. Deerwester, S.; Dumais, S.; Landauer, T.; Furnass, G.; Beck, L.: Improving information retrieval with latent semantic indexing (1988) 0.00
    0.0010962184 = product of:
      0.0032886548 = sum of:
        0.0032886548 = product of:
          0.0065773097 = sum of:
            0.0065773097 = weight(_text_:a in 2396) [ClassicSimilarity], result of:
              0.0065773097 = score(doc=2396,freq=6.0), product of:
                0.04968032 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.04308612 = queryNorm
                0.13239266 = fieldWeight in 2396, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2396)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    Describes a latent semantic indexing (LSI) approach for improving information retrieval. Most document retrieval systems depend on matching keywords in queries against those in documents. The LSI approach tries to overcome the incompleteness and imprecision of latent relations among terms and documents. Tested performance of the LSI method ranged from considerably better than to roughly comparable to performance based on weighted keyword matching, apparently depending on the quality of the queries. Best LSI performance was found using a global entropy weighting for terms and about 100 dimensions for representing terms, documents and queries.
    Type
    a