Document (#37692)

Author
Zhu, W.Z.
Allen, R.B.
Title
Document clustering using the LSI subspace signature model
Source
Journal of the American Society for Information Science and Technology. 64(2013) no.4, S.844-860
Year
2013
Abstract
We describe the latent semantic indexing subspace signature model (LSISSM) for semantic content representation of unstructured text. Grounded on singular value decomposition, the model represents terms and documents by the distribution signatures of their statistical contribution across the top-ranking latent concept dimensions. LSISSM matches term signatures with document signatures according to their mapping coherence between latent semantic indexing (LSI) term subspace and LSI document subspace. LSISSM does feature reduction and finds a low-rank approximation of scalable and sparse term-document matrices. Experiments demonstrate that this approach significantly improves the performance of major clustering algorithms such as standard K-means and self-organizing maps compared with the vector space model and the traditional LSI model. The unique contribution ranking mechanism in LSISSM also improves the initialization of standard K-means compared with random seeding procedure, which sometimes causes low efficiency and effectiveness of clustering. A two-stage initialization strategy based on LSISSM significantly reduces the running time of standard K-means procedures.
Theme
Automatisches Klassifizieren
Object
Latent semantic indexing

Similar documents (author)

  1. Allen, B.; Allen, G.: Cognitive abilities of academic librarians and their patrons (1993) 5.38
    5.375742 = sum of:
      5.375742 = weight(author_txt:allen in 6046) [ClassicSimilarity], result of:
        5.375742 = fieldWeight in 6046, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          7.6024475 = idf(docFreq=57, maxDocs=42740)
          0.5 = fieldNorm(doc=6046)
    
  2. Allen, M.M.: Bluetooth bytes information retrieval (2001) 4.75
    4.7515297 = sum of:
      4.7515297 = weight(author_txt:allen in 747) [ClassicSimilarity], result of:
        4.7515297 = fieldWeight in 747, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.6024475 = idf(docFreq=57, maxDocs=42740)
          0.625 = fieldNorm(doc=747)
    
  3. Allen, B.: Topic knowledge and online catalog search formulation (1991) 4.75
    4.7515297 = sum of:
      4.7515297 = weight(author_txt:allen in 1071) [ClassicSimilarity], result of:
        4.7515297 = fieldWeight in 1071, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.6024475 = idf(docFreq=57, maxDocs=42740)
          0.625 = fieldNorm(doc=1071)
    
  4. Allen, L.: Alphabetical subject access, LCSH and a non-traditional approach (1981) 4.75
    4.7515297 = sum of:
      4.7515297 = weight(author_txt:allen in 1571) [ClassicSimilarity], result of:
        4.7515297 = fieldWeight in 1571, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.6024475 = idf(docFreq=57, maxDocs=42740)
          0.625 = fieldNorm(doc=1571)
    
  5. Allen, G.G.: Change in the catalogue in the context of library management (1976) 4.75
    4.7515297 = sum of:
      4.7515297 = weight(author_txt:allen in 1575) [ClassicSimilarity], result of:
        4.7515297 = fieldWeight in 1575, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.6024475 = idf(docFreq=57, maxDocs=42740)
          0.625 = fieldNorm(doc=1575)
    

Similar documents (content)

  1. Berry, M.W.; Dumais, S.T.; O'Brien, G.W.: Using linear algebra for intelligent information retrieval (1995) 0.45
    0.4457574 = sum of:
      0.4457574 = product of:
        1.1143935 = sum of:
          0.06363542 = weight(abstract_txt:decomposition in 4207) [ClassicSimilarity], result of:
            0.06363542 = score(doc=4207,freq=1.0), product of:
              0.103083424 = queryWeight, product of:
                1.0369891 = boost
                7.9016905 = idf(docFreq=42, maxDocs=42740)
                0.012580405 = queryNorm
              0.6173196 = fieldWeight in 4207, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9016905 = idf(docFreq=42, maxDocs=42740)
                0.078125 = fieldNorm(doc=4207)
          0.06666893 = weight(abstract_txt:matrices in 4207) [ClassicSimilarity], result of:
            0.06666893 = score(doc=4207,freq=1.0), product of:
              0.106333934 = queryWeight, product of:
                1.0532118 = boost
                8.025305 = idf(docFreq=37, maxDocs=42740)
                0.012580405 = queryNorm
              0.62697697 = fieldWeight in 4207, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.025305 = idf(docFreq=37, maxDocs=42740)
                0.078125 = fieldNorm(doc=4207)
          0.10830164 = weight(abstract_txt:singular in 4207) [ClassicSimilarity], result of:
            0.10830164 = score(doc=4207,freq=2.0), product of:
              0.11662803 = queryWeight, product of:
                1.1030146 = boost
                8.404794 = idf(docFreq=25, maxDocs=42740)
                0.012580405 = queryNorm
              0.9286073 = fieldWeight in 4207, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.404794 = idf(docFreq=25, maxDocs=42740)
                0.078125 = fieldNorm(doc=4207)
          0.078789674 = weight(abstract_txt:sparse in 4207) [ClassicSimilarity], result of:
            0.078789674 = score(doc=4207,freq=1.0), product of:
              0.118860014 = queryWeight, product of:
                1.1135191 = boost
                8.484837 = idf(docFreq=23, maxDocs=42740)
                0.012580405 = queryNorm
              0.66287786 = fieldWeight in 4207, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.484837 = idf(docFreq=23, maxDocs=42740)
                0.078125 = fieldNorm(doc=4207)
          0.029830955 = weight(abstract_txt:indexing in 4207) [ClassicSimilarity], result of:
            0.029830955 = score(doc=4207,freq=2.0), product of:
              0.062206294 = queryWeight, product of:
                1.1392314 = boost
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.012580405 = queryNorm
              0.4795488 = fieldWeight in 4207, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.078125 = fieldNorm(doc=4207)
          0.035218254 = weight(abstract_txt:semantic in 4207) [ClassicSimilarity], result of:
            0.035218254 = score(doc=4207,freq=1.0), product of:
              0.10021711 = queryWeight, product of:
                1.7709706 = boost
                4.4981704 = idf(docFreq=1292, maxDocs=42740)
                0.012580405 = queryNorm
              0.35141957 = fieldWeight in 4207, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4981704 = idf(docFreq=1292, maxDocs=42740)
                0.078125 = fieldNorm(doc=4207)
          0.043568518 = weight(abstract_txt:term in 4207) [ClassicSimilarity], result of:
            0.043568518 = score(doc=4207,freq=1.0), product of:
              0.11549022 = queryWeight, product of:
                1.9011353 = boost
                4.8287816 = idf(docFreq=928, maxDocs=42740)
                0.012580405 = queryNorm
              0.37724856 = fieldWeight in 4207, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8287816 = idf(docFreq=928, maxDocs=42740)
                0.078125 = fieldNorm(doc=4207)
          0.057237193 = weight(abstract_txt:document in 4207) [ClassicSimilarity], result of:
            0.057237193 = score(doc=4207,freq=2.0), product of:
              0.121018514 = queryWeight, product of:
                2.2471688 = boost
                4.280766 = idf(docFreq=1606, maxDocs=42740)
                0.012580405 = queryNorm
              0.4729623 = fieldWeight in 4207, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.280766 = idf(docFreq=1606, maxDocs=42740)
                0.078125 = fieldNorm(doc=4207)
          0.134894 = weight(abstract_txt:latent in 4207) [ClassicSimilarity], result of:
            0.134894 = score(doc=4207,freq=1.0), product of:
              0.24533439 = queryWeight, product of:
                2.7708921 = boost
                7.0379176 = idf(docFreq=101, maxDocs=42740)
                0.012580405 = queryNorm
              0.5498373 = fieldWeight in 4207, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0379176 = idf(docFreq=101, maxDocs=42740)
                0.078125 = fieldNorm(doc=4207)
          0.4962489 = weight(abstract_txt:subspace in 4207) [ClassicSimilarity], result of:
            0.4962489 = score(doc=4207,freq=1.0), product of:
              0.6434912 = queryWeight, product of:
                5.1818056 = boost
                9.871131 = idf(docFreq=5, maxDocs=42740)
                0.012580405 = queryNorm
              0.7711821 = fieldWeight in 4207, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.871131 = idf(docFreq=5, maxDocs=42740)
                0.078125 = fieldNorm(doc=4207)
        0.4 = coord(10/25)
    
  2. Li, D.; Kwong, C.-P.; Lee, D.L.: Unified linear subspace approach to semantic analysis (2009) 0.24
    0.24255481 = sum of:
      0.24255481 = product of:
        0.606387 = sum of:
          0.05090833 = weight(abstract_txt:decomposition in 322) [ClassicSimilarity], result of:
            0.05090833 = score(doc=322,freq=1.0), product of:
              0.103083424 = queryWeight, product of:
                1.0369891 = boost
                7.9016905 = idf(docFreq=42, maxDocs=42740)
                0.012580405 = queryNorm
              0.49385566 = fieldWeight in 322, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9016905 = idf(docFreq=42, maxDocs=42740)
                0.0625 = fieldNorm(doc=322)
          0.061264656 = weight(abstract_txt:singular in 322) [ClassicSimilarity], result of:
            0.061264656 = score(doc=322,freq=1.0), product of:
              0.11662803 = queryWeight, product of:
                1.1030146 = boost
                8.404794 = idf(docFreq=25, maxDocs=42740)
                0.012580405 = queryNorm
              0.5252996 = fieldWeight in 322, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.404794 = idf(docFreq=25, maxDocs=42740)
                0.0625 = fieldNorm(doc=322)
          0.016874935 = weight(abstract_txt:indexing in 322) [ClassicSimilarity], result of:
            0.016874935 = score(doc=322,freq=1.0), product of:
              0.062206294 = queryWeight, product of:
                1.1392314 = boost
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.012580405 = queryNorm
              0.27127376 = fieldWeight in 322, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.0625 = fieldNorm(doc=322)
          0.03518154 = weight(abstract_txt:significantly in 322) [ClassicSimilarity], result of:
            0.03518154 = score(doc=322,freq=1.0), product of:
              0.10151951 = queryWeight, product of:
                1.455357 = boost
                5.544793 = idf(docFreq=453, maxDocs=42740)
                0.012580405 = queryNorm
              0.34654957 = fieldWeight in 322, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.544793 = idf(docFreq=453, maxDocs=42740)
                0.0625 = fieldNorm(doc=322)
          0.0690134 = weight(abstract_txt:semantic in 322) [ClassicSimilarity], result of:
            0.0690134 = score(doc=322,freq=6.0), product of:
              0.10021711 = queryWeight, product of:
                1.7709706 = boost
                4.4981704 = idf(docFreq=1292, maxDocs=42740)
                0.012580405 = queryNorm
              0.6886389 = fieldWeight in 322, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.4981704 = idf(docFreq=1292, maxDocs=42740)
                0.0625 = fieldNorm(doc=322)
          0.03258708 = weight(abstract_txt:standard in 322) [ClassicSimilarity], result of:
            0.03258708 = score(doc=322,freq=1.0), product of:
              0.11042489 = queryWeight, product of:
                1.8589765 = boost
                4.7217007 = idf(docFreq=1033, maxDocs=42740)
                0.012580405 = queryNorm
              0.2951063 = fieldWeight in 322, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7217007 = idf(docFreq=1033, maxDocs=42740)
                0.0625 = fieldNorm(doc=322)
          0.06037031 = weight(abstract_txt:term in 322) [ClassicSimilarity], result of:
            0.06037031 = score(doc=322,freq=3.0), product of:
              0.11549022 = queryWeight, product of:
                1.9011353 = boost
                4.8287816 = idf(docFreq=928, maxDocs=42740)
                0.012580405 = queryNorm
              0.52273095 = fieldWeight in 322, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.8287816 = idf(docFreq=928, maxDocs=42740)
                0.0625 = fieldNorm(doc=322)
          0.045789756 = weight(abstract_txt:document in 322) [ClassicSimilarity], result of:
            0.045789756 = score(doc=322,freq=2.0), product of:
              0.121018514 = queryWeight, product of:
                2.2471688 = boost
                4.280766 = idf(docFreq=1606, maxDocs=42740)
                0.012580405 = queryNorm
              0.37836984 = fieldWeight in 322, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.280766 = idf(docFreq=1606, maxDocs=42740)
                0.0625 = fieldNorm(doc=322)
          0.04748243 = weight(abstract_txt:model in 322) [ClassicSimilarity], result of:
            0.04748243 = score(doc=322,freq=2.0), product of:
              0.13355646 = queryWeight, product of:
                2.6393516 = boost
                4.022287 = idf(docFreq=2080, maxDocs=42740)
                0.012580405 = queryNorm
              0.3555233 = fieldWeight in 322, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.022287 = idf(docFreq=2080, maxDocs=42740)
                0.0625 = fieldNorm(doc=322)
          0.18691461 = weight(abstract_txt:latent in 322) [ClassicSimilarity], result of:
            0.18691461 = score(doc=322,freq=3.0), product of:
              0.24533439 = queryWeight, product of:
                2.7708921 = boost
                7.0379176 = idf(docFreq=101, maxDocs=42740)
                0.012580405 = queryNorm
              0.76187694 = fieldWeight in 322, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.0379176 = idf(docFreq=101, maxDocs=42740)
                0.0625 = fieldNorm(doc=322)
        0.4 = coord(10/25)
    
  3. Ding, C.H.Q.: ¬A probabilistic model for Latent Semantic Indexing (2005) 0.19
    0.18557149 = sum of:
      0.18557149 = product of:
        0.57991093 = sum of:
          0.021093668 = weight(abstract_txt:indexing in 4460) [ClassicSimilarity], result of:
            0.021093668 = score(doc=4460,freq=1.0), product of:
              0.062206294 = queryWeight, product of:
                1.1392314 = boost
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.012580405 = queryNorm
              0.3390922 = fieldWeight in 4460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.078125 = fieldNorm(doc=4460)
          0.051299535 = weight(abstract_txt:contribution in 4460) [ClassicSimilarity], result of:
            0.051299535 = score(doc=4460,freq=1.0), product of:
              0.11249724 = queryWeight, product of:
                1.5320245 = boost
                5.83689 = idf(docFreq=338, maxDocs=42740)
                0.012580405 = queryNorm
              0.45600706 = fieldWeight in 4460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.83689 = idf(docFreq=338, maxDocs=42740)
                0.078125 = fieldNorm(doc=4460)
          0.07875042 = weight(abstract_txt:semantic in 4460) [ClassicSimilarity], result of:
            0.07875042 = score(doc=4460,freq=5.0), product of:
              0.10021711 = queryWeight, product of:
                1.7709706 = boost
                4.4981704 = idf(docFreq=1292, maxDocs=42740)
                0.012580405 = queryNorm
              0.7857981 = fieldWeight in 4460, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.4981704 = idf(docFreq=1292, maxDocs=42740)
                0.078125 = fieldNorm(doc=4460)
          0.08067434 = weight(abstract_txt:improves in 4460) [ClassicSimilarity], result of:
            0.08067434 = score(doc=4460,freq=1.0), product of:
              0.15213293 = queryWeight, product of:
                1.7815844 = boost
                6.787693 = idf(docFreq=130, maxDocs=42740)
                0.012580405 = queryNorm
              0.5302885 = fieldWeight in 4460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.787693 = idf(docFreq=130, maxDocs=42740)
                0.078125 = fieldNorm(doc=4460)
          0.04073385 = weight(abstract_txt:standard in 4460) [ClassicSimilarity], result of:
            0.04073385 = score(doc=4460,freq=1.0), product of:
              0.11042489 = queryWeight, product of:
                1.8589765 = boost
                4.7217007 = idf(docFreq=1033, maxDocs=42740)
                0.012580405 = queryNorm
              0.36888286 = fieldWeight in 4460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7217007 = idf(docFreq=1033, maxDocs=42740)
                0.078125 = fieldNorm(doc=4460)
          0.057237193 = weight(abstract_txt:document in 4460) [ClassicSimilarity], result of:
            0.057237193 = score(doc=4460,freq=2.0), product of:
              0.121018514 = queryWeight, product of:
                2.2471688 = boost
                4.280766 = idf(docFreq=1606, maxDocs=42740)
                0.012580405 = queryNorm
              0.4729623 = fieldWeight in 4460, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.280766 = idf(docFreq=1606, maxDocs=42740)
                0.078125 = fieldNorm(doc=4460)
          0.05935304 = weight(abstract_txt:model in 4460) [ClassicSimilarity], result of:
            0.05935304 = score(doc=4460,freq=2.0), product of:
              0.13355646 = queryWeight, product of:
                2.6393516 = boost
                4.022287 = idf(docFreq=2080, maxDocs=42740)
                0.012580405 = queryNorm
              0.44440413 = fieldWeight in 4460, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.022287 = idf(docFreq=2080, maxDocs=42740)
                0.078125 = fieldNorm(doc=4460)
          0.19076891 = weight(abstract_txt:latent in 4460) [ClassicSimilarity], result of:
            0.19076891 = score(doc=4460,freq=2.0), product of:
              0.24533439 = queryWeight, product of:
                2.7708921 = boost
                7.0379176 = idf(docFreq=101, maxDocs=42740)
                0.012580405 = queryNorm
              0.77758735 = fieldWeight in 4460, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0379176 = idf(docFreq=101, maxDocs=42740)
                0.078125 = fieldNorm(doc=4460)
        0.32 = coord(8/25)
    
  4. Dunlavy, D.M.; O'Leary, D.P.; Conroy, J.M.; Schlesinger, J.D.: QCS: A system for querying, clustering and summarizing documents (2007) 0.18
    0.18366538 = sum of:
      0.18366538 = product of:
        0.45916343 = sum of:
          0.04454479 = weight(abstract_txt:decomposition in 2948) [ClassicSimilarity], result of:
            0.04454479 = score(doc=2948,freq=1.0), product of:
              0.103083424 = queryWeight, product of:
                1.0369891 = boost
                7.9016905 = idf(docFreq=42, maxDocs=42740)
                0.012580405 = queryNorm
              0.4321237 = fieldWeight in 2948, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9016905 = idf(docFreq=42, maxDocs=42740)
                0.0546875 = fieldNorm(doc=2948)
          0.014765569 = weight(abstract_txt:indexing in 2948) [ClassicSimilarity], result of:
            0.014765569 = score(doc=2948,freq=1.0), product of:
              0.062206294 = queryWeight, product of:
                1.1392314 = boost
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.012580405 = queryNorm
              0.23736455 = fieldWeight in 2948, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.0546875 = fieldNorm(doc=2948)
          0.024652777 = weight(abstract_txt:semantic in 2948) [ClassicSimilarity], result of:
            0.024652777 = score(doc=2948,freq=1.0), product of:
              0.10021711 = queryWeight, product of:
                1.7709706 = boost
                4.4981704 = idf(docFreq=1292, maxDocs=42740)
                0.012580405 = queryNorm
              0.24599369 = fieldWeight in 2948, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4981704 = idf(docFreq=1292, maxDocs=42740)
                0.0546875 = fieldNorm(doc=2948)
          0.05647204 = weight(abstract_txt:improves in 2948) [ClassicSimilarity], result of:
            0.05647204 = score(doc=2948,freq=1.0), product of:
              0.15213293 = queryWeight, product of:
                1.7815844 = boost
                6.787693 = idf(docFreq=130, maxDocs=42740)
                0.012580405 = queryNorm
              0.37120196 = fieldWeight in 2948, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.787693 = idf(docFreq=130, maxDocs=42740)
                0.0546875 = fieldNorm(doc=2948)
          0.028513694 = weight(abstract_txt:standard in 2948) [ClassicSimilarity], result of:
            0.028513694 = score(doc=2948,freq=1.0), product of:
              0.11042489 = queryWeight, product of:
                1.8589765 = boost
                4.7217007 = idf(docFreq=1033, maxDocs=42740)
                0.012580405 = queryNorm
              0.258218 = fieldWeight in 2948, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7217007 = idf(docFreq=1033, maxDocs=42740)
                0.0546875 = fieldNorm(doc=2948)
          0.034141824 = weight(abstract_txt:means in 2948) [ClassicSimilarity], result of:
            0.034141824 = score(doc=2948,freq=1.0), product of:
              0.12451522 = queryWeight, product of:
                1.9740204 = boost
                5.013906 = idf(docFreq=771, maxDocs=42740)
                0.012580405 = queryNorm
              0.274198 = fieldWeight in 2948, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.013906 = idf(docFreq=771, maxDocs=42740)
                0.0546875 = fieldNorm(doc=2948)
          0.040066037 = weight(abstract_txt:document in 2948) [ClassicSimilarity], result of:
            0.040066037 = score(doc=2948,freq=2.0), product of:
              0.121018514 = queryWeight, product of:
                2.2471688 = boost
                4.280766 = idf(docFreq=1606, maxDocs=42740)
                0.012580405 = queryNorm
              0.3310736 = fieldWeight in 2948, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.280766 = idf(docFreq=1606, maxDocs=42740)
                0.0546875 = fieldNorm(doc=2948)
          0.09220264 = weight(abstract_txt:clustering in 2948) [ClassicSimilarity], result of:
            0.09220264 = score(doc=2948,freq=2.0), product of:
              0.19165356 = queryWeight, product of:
                2.4490569 = boost
                6.220473 = idf(docFreq=230, maxDocs=42740)
                0.012580405 = queryNorm
              0.48109016 = fieldWeight in 2948, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.220473 = idf(docFreq=230, maxDocs=42740)
                0.0546875 = fieldNorm(doc=2948)
          0.029378254 = weight(abstract_txt:model in 2948) [ClassicSimilarity], result of:
            0.029378254 = score(doc=2948,freq=1.0), product of:
              0.13355646 = queryWeight, product of:
                2.6393516 = boost
                4.022287 = idf(docFreq=2080, maxDocs=42740)
                0.012580405 = queryNorm
              0.21996881 = fieldWeight in 2948, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.022287 = idf(docFreq=2080, maxDocs=42740)
                0.0546875 = fieldNorm(doc=2948)
          0.0944258 = weight(abstract_txt:latent in 2948) [ClassicSimilarity], result of:
            0.0944258 = score(doc=2948,freq=1.0), product of:
              0.24533439 = queryWeight, product of:
                2.7708921 = boost
                7.0379176 = idf(docFreq=101, maxDocs=42740)
                0.012580405 = queryNorm
              0.38488612 = fieldWeight in 2948, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0379176 = idf(docFreq=101, maxDocs=42740)
                0.0546875 = fieldNorm(doc=2948)
        0.4 = coord(10/25)
    
  5. Chen, L.; Zeng, J.; Tokuda, N.: ¬A "stereo" document representation for textual information retrieval (2006) 0.18
    0.18326618 = sum of:
      0.18326618 = product of:
        0.5727068 = sum of:
          0.021093668 = weight(abstract_txt:indexing in 293) [ClassicSimilarity], result of:
            0.021093668 = score(doc=293,freq=1.0), product of:
              0.062206294 = queryWeight, product of:
                1.1392314 = boost
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.012580405 = queryNorm
              0.3390922 = fieldWeight in 293, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.078125 = fieldNorm(doc=293)
          0.035218254 = weight(abstract_txt:semantic in 293) [ClassicSimilarity], result of:
            0.035218254 = score(doc=293,freq=1.0), product of:
              0.10021711 = queryWeight, product of:
                1.7709706 = boost
                4.4981704 = idf(docFreq=1292, maxDocs=42740)
                0.012580405 = queryNorm
              0.35141957 = fieldWeight in 293, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4981704 = idf(docFreq=1292, maxDocs=42740)
                0.078125 = fieldNorm(doc=293)
          0.08067434 = weight(abstract_txt:improves in 293) [ClassicSimilarity], result of:
            0.08067434 = score(doc=293,freq=1.0), product of:
              0.15213293 = queryWeight, product of:
                1.7815844 = boost
                6.787693 = idf(docFreq=130, maxDocs=42740)
                0.012580405 = queryNorm
              0.5302885 = fieldWeight in 293, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.787693 = idf(docFreq=130, maxDocs=42740)
                0.078125 = fieldNorm(doc=293)
          0.057606366 = weight(abstract_txt:standard in 293) [ClassicSimilarity], result of:
            0.057606366 = score(doc=293,freq=2.0), product of:
              0.11042489 = queryWeight, product of:
                1.8589765 = boost
                4.7217007 = idf(docFreq=1033, maxDocs=42740)
                0.012580405 = queryNorm
              0.52167916 = fieldWeight in 293, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7217007 = idf(docFreq=1033, maxDocs=42740)
                0.078125 = fieldNorm(doc=293)
          0.043568518 = weight(abstract_txt:term in 293) [ClassicSimilarity], result of:
            0.043568518 = score(doc=293,freq=1.0), product of:
              0.11549022 = queryWeight, product of:
                1.9011353 = boost
                4.8287816 = idf(docFreq=928, maxDocs=42740)
                0.012580405 = queryNorm
              0.37724856 = fieldWeight in 293, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8287816 = idf(docFreq=928, maxDocs=42740)
                0.078125 = fieldNorm(doc=293)
          0.08094561 = weight(abstract_txt:document in 293) [ClassicSimilarity], result of:
            0.08094561 = score(doc=293,freq=4.0), product of:
              0.121018514 = queryWeight, product of:
                2.2471688 = boost
                4.280766 = idf(docFreq=1606, maxDocs=42740)
                0.012580405 = queryNorm
              0.6688697 = fieldWeight in 293, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.280766 = idf(docFreq=1606, maxDocs=42740)
                0.078125 = fieldNorm(doc=293)
          0.11870608 = weight(abstract_txt:model in 293) [ClassicSimilarity], result of:
            0.11870608 = score(doc=293,freq=8.0), product of:
              0.13355646 = queryWeight, product of:
                2.6393516 = boost
                4.022287 = idf(docFreq=2080, maxDocs=42740)
                0.012580405 = queryNorm
              0.88880825 = fieldWeight in 293, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.022287 = idf(docFreq=2080, maxDocs=42740)
                0.078125 = fieldNorm(doc=293)
          0.134894 = weight(abstract_txt:latent in 293) [ClassicSimilarity], result of:
            0.134894 = score(doc=293,freq=1.0), product of:
              0.24533439 = queryWeight, product of:
                2.7708921 = boost
                7.0379176 = idf(docFreq=101, maxDocs=42740)
                0.012580405 = queryNorm
              0.5498373 = fieldWeight in 293, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0379176 = idf(docFreq=101, maxDocs=42740)
                0.078125 = fieldNorm(doc=293)
        0.32 = coord(8/25)