Document (#34122)

Author
Wang, W.M.
Cheung, C.F.
Lee, W.B.
Kwok, S.K.
Title
Mining knowledge from natural language texts using fuzzy associated concept mapping
Source
Information processing and management. 44(2008) no.5, S.1707-1719
Year
2008
Abstract
Natural Language Processing (NLP) techniques have been successfully used to automatically extract information from unstructured text through a detailed analysis of their content, often to satisfy particular information needs. In this paper, an automatic concept map construction technique, Fuzzy Association Concept Mapping (FACM), is proposed for the conversion of abstracted short texts into concept maps. The approach consists of a linguistic module and a recommendation module. The linguistic module is a text mining method that does not require the use to have any prior knowledge about using NLP techniques. It incorporates rule-based reasoning (RBR) and case based reasoning (CBR) for anaphoric resolution. It aims at extracting the propositions in text so as to construct a concept map automatically. The recommendation module is arrived at by adopting fuzzy set theories. It is an interactive process which provides suggestions of propositions for further human refinement of the automatically generated concept maps. The suggested propositions are relationships among the concepts which are not explicitly found in the paragraphs. This technique helps to stimulate individual reflection and generate new knowledge. Evaluation was carried out by using the Science Citation Index (SCI) abstract database and CNET News as test data, which are well known databases and the quality of the text is assured. Experimental results show that the automatically generated concept maps conform to the outputs generated manually by domain experts, since the degree of difference between them is proportionally small. The method provides users with the ability to convert scientific and short texts into a structured format which can be easily processed by computer. Moreover, it provides knowledge workers with extra time to re-think their written text and to view their knowledge from another angle.
Theme
Data Mining

Similar documents (author)

  1. Tsui, E.; Wang, W.M.; Cheung, C.F.; Lau, A.S.M.: ¬A concept-relationship acquisition and inference approach for hierarchical taxonomy construction from tags (2010) 1.78
    1.7760166 = sum of:
      1.7760166 = product of:
        2.6640248 = sum of:
          0.7070545 = weight(author_txt:wang in 4220) [ClassicSimilarity], result of:
            0.7070545 = score(doc=4220,freq=1.0), product of:
              0.3448474 = queryWeight, product of:
                6.5610886 = idf(docFreq=169, maxDocs=44218)
                0.05255948 = queryNorm
              2.0503402 = fieldWeight in 4220, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5610886 = idf(docFreq=169, maxDocs=44218)
                0.3125 = fieldNorm(doc=4220)
          1.9569703 = weight(author_txt:cheung in 4220) [ClassicSimilarity], result of:
            1.9569703 = score(doc=4220,freq=1.0), product of:
              0.67980003 = queryWeight, product of:
                1.4040323 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.05255948 = queryNorm
              2.8787441 = fieldWeight in 4220, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.3125 = fieldNorm(doc=4220)
        0.6666667 = coord(2/3)
    
  2. Kwok, K.L.: ¬The use of titles and cited titles as document representations for automatic classification (1975) 1.21
    1.212117 = sum of:
      1.212117 = product of:
        3.6363506 = sum of:
          3.6363506 = weight(author_txt:kwok in 4347) [ClassicSimilarity], result of:
            3.6363506 = score(doc=4347,freq=1.0), product of:
              0.64726514 = queryWeight, product of:
                1.3700223 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.05255948 = queryNorm
              5.6180234 = fieldWeight in 4347, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.625 = fieldNorm(doc=4347)
        0.33333334 = coord(1/3)
    
  3. Kwok, K.L.: Employing multiple representations for Chinese information retrieval (1999) 1.21
    1.212117 = sum of:
      1.212117 = product of:
        3.6363506 = sum of:
          3.6363506 = weight(author_txt:kwok in 3773) [ClassicSimilarity], result of:
            3.6363506 = score(doc=3773,freq=1.0), product of:
              0.64726514 = queryWeight, product of:
                1.3700223 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.05255948 = queryNorm
              5.6180234 = fieldWeight in 3773, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.625 = fieldNorm(doc=3773)
        0.33333334 = coord(1/3)
    
  4. Kwok, K.L.: ¬A network approach to probabilistic information retrieval (1995) 1.21
    1.212117 = sum of:
      1.212117 = product of:
        3.6363506 = sum of:
          3.6363506 = weight(author_txt:kwok in 5696) [ClassicSimilarity], result of:
            3.6363506 = score(doc=5696,freq=1.0), product of:
              0.64726514 = queryWeight, product of:
                1.3700223 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.05255948 = queryNorm
              5.6180234 = fieldWeight in 5696, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.625 = fieldNorm(doc=5696)
        0.33333334 = coord(1/3)
    
  5. Kwok, K.L.: Improving English and Chinese ad-hoc retrieval : a TIPSTER text phase 3 project report (2000) 1.21
    1.212117 = sum of:
      1.212117 = product of:
        3.6363506 = sum of:
          3.6363506 = weight(author_txt:kwok in 6388) [ClassicSimilarity], result of:
            3.6363506 = score(doc=6388,freq=1.0), product of:
              0.64726514 = queryWeight, product of:
                1.3700223 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.05255948 = queryNorm
              5.6180234 = fieldWeight in 6388, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.625 = fieldNorm(doc=6388)
        0.33333334 = coord(1/3)
    

Similar documents (content)

  1. Kim, J.-M.; Shin, H.; Kim, H.-J.: Schema and constraints-based matching and merging of Topic Maps (2007) 0.27
    0.27026436 = sum of:
      0.27026436 = product of:
        0.8445761 = sum of:
          0.02350651 = weight(abstract_txt:techniques in 922) [ClassicSimilarity], result of:
            0.02350651 = score(doc=922,freq=1.0), product of:
              0.08302796 = queryWeight, product of:
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.018329076 = queryNorm
              0.2831156 = fieldWeight in 922, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.0625 = fieldNorm(doc=922)
          0.015755426 = weight(abstract_txt:using in 922) [ClassicSimilarity], result of:
            0.015755426 = score(doc=922,freq=1.0), product of:
              0.07279185 = queryWeight, product of:
                1.1467661 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.018329076 = queryNorm
              0.21644491 = fieldWeight in 922, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.0625 = fieldNorm(doc=922)
          0.070678614 = weight(abstract_txt:linguistic in 922) [ClassicSimilarity], result of:
            0.070678614 = score(doc=922,freq=2.0), product of:
              0.13728222 = queryWeight, product of:
                1.2858638 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.018329076 = queryNorm
              0.51484174 = fieldWeight in 922, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0625 = fieldNorm(doc=922)
          0.012550143 = weight(abstract_txt:which in 922) [ClassicSimilarity], result of:
            0.012550143 = score(doc=922,freq=1.0), product of:
              0.06884536 = queryWeight, product of:
                1.2877755 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.018329076 = queryNorm
              0.18229467 = fieldWeight in 922, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.0625 = fieldNorm(doc=922)
          0.09028198 = weight(abstract_txt:generated in 922) [ClassicSimilarity], result of:
            0.09028198 = score(doc=922,freq=2.0), product of:
              0.1850064 = queryWeight, product of:
                1.8282131 = boost
                5.52102 = idf(docFreq=480, maxDocs=44218)
                0.018329076 = queryNorm
              0.4879938 = fieldWeight in 922, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.52102 = idf(docFreq=480, maxDocs=44218)
                0.0625 = fieldNorm(doc=922)
          0.17321333 = weight(abstract_txt:maps in 922) [ClassicSimilarity], result of:
            0.17321333 = score(doc=922,freq=5.0), product of:
              0.21047163 = queryWeight, product of:
                1.9499804 = boost
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.018329076 = queryNorm
              0.8229771 = fieldWeight in 922, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.0625 = fieldNorm(doc=922)
          0.08492688 = weight(abstract_txt:automatically in 922) [ClassicSimilarity], result of:
            0.08492688 = score(doc=922,freq=1.0), product of:
              0.24630451 = queryWeight, product of:
                2.4357853 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.018329076 = queryNorm
              0.3448044 = fieldWeight in 922, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.0625 = fieldNorm(doc=922)
          0.37366325 = weight(abstract_txt:module in 922) [ClassicSimilarity], result of:
            0.37366325 = score(doc=922,freq=4.0), product of:
              0.4166223 = queryWeight, product of:
                3.1679192 = boost
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.018329076 = queryNorm
              0.8968873 = fieldWeight in 922, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.0625 = fieldNorm(doc=922)
        0.32 = coord(8/25)
    
  2. Rindflesch, T.C.; Fizsman, M.: The interaction of domain knowledge and linguistic structure in natural language processing : interpreting hypernymic propositions in biomedical text (2003) 0.24
    0.24100988 = sum of:
      0.24100988 = product of:
        0.7531559 = sum of:
          0.033143297 = weight(abstract_txt:natural in 2097) [ClassicSimilarity], result of:
            0.033143297 = score(doc=2097,freq=1.0), product of:
              0.104398936 = queryWeight, product of:
                1.1213362 = boost
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.018329076 = queryNorm
              0.31746778 = fieldWeight in 2097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.0625 = fieldNorm(doc=2097)
          0.070678614 = weight(abstract_txt:linguistic in 2097) [ClassicSimilarity], result of:
            0.070678614 = score(doc=2097,freq=2.0), product of:
              0.13728222 = queryWeight, product of:
                1.2858638 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.018329076 = queryNorm
              0.51484174 = fieldWeight in 2097, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0625 = fieldNorm(doc=2097)
          0.025100285 = weight(abstract_txt:which in 2097) [ClassicSimilarity], result of:
            0.025100285 = score(doc=2097,freq=4.0), product of:
              0.06884536 = queryWeight, product of:
                1.2877755 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.018329076 = queryNorm
              0.36458933 = fieldWeight in 2097, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.0625 = fieldNorm(doc=2097)
          0.028424805 = weight(abstract_txt:provides in 2097) [ClassicSimilarity], result of:
            0.028424805 = score(doc=2097,freq=1.0), product of:
              0.1078767 = queryWeight, product of:
                1.3960382 = boost
                4.215895 = idf(docFreq=1773, maxDocs=44218)
                0.018329076 = queryNorm
              0.26349345 = fieldWeight in 2097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.215895 = idf(docFreq=1773, maxDocs=44218)
                0.0625 = fieldNorm(doc=2097)
          0.049107246 = weight(abstract_txt:knowledge in 2097) [ClassicSimilarity], result of:
            0.049107246 = score(doc=2097,freq=3.0), product of:
              0.12768373 = queryWeight, product of:
                1.9607652 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.018329076 = queryNorm
              0.38460067 = fieldWeight in 2097, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.0625 = fieldNorm(doc=2097)
          0.05912652 = weight(abstract_txt:text in 2097) [ClassicSimilarity], result of:
            0.05912652 = score(doc=2097,freq=2.0), product of:
              0.16542093 = queryWeight, product of:
                2.2317886 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.018329076 = queryNorm
              0.3574307 = fieldWeight in 2097, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=2097)
          0.3473641 = weight(abstract_txt:propositions in 2097) [ClassicSimilarity], result of:
            0.3473641 = score(doc=2097,freq=3.0), product of:
              0.39683706 = queryWeight, product of:
                2.6775622 = boost
                8.085969 = idf(docFreq=36, maxDocs=44218)
                0.018329076 = queryNorm
              0.8753318 = fieldWeight in 2097, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.085969 = idf(docFreq=36, maxDocs=44218)
                0.0625 = fieldNorm(doc=2097)
          0.14021106 = weight(abstract_txt:concept in 2097) [ClassicSimilarity], result of:
            0.14021106 = score(doc=2097,freq=3.0), product of:
              0.28747675 = queryWeight, product of:
                3.4811537 = boost
                4.505458 = idf(docFreq=1327, maxDocs=44218)
                0.018329076 = queryNorm
              0.48773012 = fieldWeight in 2097, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.505458 = idf(docFreq=1327, maxDocs=44218)
                0.0625 = fieldNorm(doc=2097)
        0.32 = coord(8/25)
    
  3. Chen, R.-S.; Hu, Y.-C.: ¬A novel method for discovering Fuzzy sequential patterns using the simple Fuzzy partition method (2003) 0.23
    0.22643325 = sum of:
      0.22643325 = product of:
        0.70760393 = sum of:
          0.0468717 = weight(abstract_txt:natural in 1614) [ClassicSimilarity], result of:
            0.0468717 = score(doc=1614,freq=2.0), product of:
              0.104398936 = queryWeight, product of:
                1.1213362 = boost
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.018329076 = queryNorm
              0.44896722 = fieldWeight in 1614, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.0625 = fieldNorm(doc=1614)
          0.015755426 = weight(abstract_txt:using in 1614) [ClassicSimilarity], result of:
            0.015755426 = score(doc=1614,freq=1.0), product of:
              0.07279185 = queryWeight, product of:
                1.1467661 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.018329076 = queryNorm
              0.21644491 = fieldWeight in 1614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.0625 = fieldNorm(doc=1614)
          0.04417135 = weight(abstract_txt:technique in 1614) [ClassicSimilarity], result of:
            0.04417135 = score(doc=1614,freq=1.0), product of:
              0.12643269 = queryWeight, product of:
                1.2340066 = boost
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.018329076 = queryNorm
              0.34936652 = fieldWeight in 1614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.0625 = fieldNorm(doc=1614)
          0.049977332 = weight(abstract_txt:linguistic in 1614) [ClassicSimilarity], result of:
            0.049977332 = score(doc=1614,freq=1.0), product of:
              0.13728222 = queryWeight, product of:
                1.2858638 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.018329076 = queryNorm
              0.3640481 = fieldWeight in 1614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0625 = fieldNorm(doc=1614)
          0.012550143 = weight(abstract_txt:which in 1614) [ClassicSimilarity], result of:
            0.012550143 = score(doc=1614,freq=1.0), product of:
              0.06884536 = queryWeight, product of:
                1.2877755 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.018329076 = queryNorm
              0.18229467 = fieldWeight in 1614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.0625 = fieldNorm(doc=1614)
          0.05955767 = weight(abstract_txt:mining in 1614) [ClassicSimilarity], result of:
            0.05955767 = score(doc=1614,freq=1.0), product of:
              0.15430881 = queryWeight, product of:
                1.3632741 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.018329076 = queryNorm
              0.38596416 = fieldWeight in 1614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.0625 = fieldNorm(doc=1614)
          0.0400959 = weight(abstract_txt:knowledge in 1614) [ClassicSimilarity], result of:
            0.0400959 = score(doc=1614,freq=2.0), product of:
              0.12768373 = queryWeight, product of:
                1.9607652 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.018329076 = queryNorm
              0.31402513 = fieldWeight in 1614, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.0625 = fieldNorm(doc=1614)
          0.43862444 = weight(abstract_txt:fuzzy in 1614) [ClassicSimilarity], result of:
            0.43862444 = score(doc=1614,freq=13.0), product of:
              0.28436542 = queryWeight, product of:
                2.2665842 = boost
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.018329076 = queryNorm
              1.5424676 = fieldWeight in 1614, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.0625 = fieldNorm(doc=1614)
        0.32 = coord(8/25)
    
  4. Jiang, X.; Tan, A.-H.: CRCTOL: a semantic-based domain ontology learning system (2009) 0.19
    0.19158076 = sum of:
      0.19158076 = product of:
        0.53216875 = sum of:
          0.029087823 = weight(abstract_txt:techniques in 3320) [ClassicSimilarity], result of:
            0.029087823 = score(doc=3320,freq=2.0), product of:
              0.08302796 = queryWeight, product of:
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.018329076 = queryNorm
              0.35033768 = fieldWeight in 3320, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.029000387 = weight(abstract_txt:natural in 3320) [ClassicSimilarity], result of:
            0.029000387 = score(doc=3320,freq=1.0), product of:
              0.104398936 = queryWeight, product of:
                1.1213362 = boost
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.018329076 = queryNorm
              0.27778432 = fieldWeight in 3320, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.03864993 = weight(abstract_txt:technique in 3320) [ClassicSimilarity], result of:
            0.03864993 = score(doc=3320,freq=1.0), product of:
              0.12643269 = queryWeight, product of:
                1.2340066 = boost
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.018329076 = queryNorm
              0.3056957 = fieldWeight in 3320, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.0902623 = weight(abstract_txt:mining in 3320) [ClassicSimilarity], result of:
            0.0902623 = score(doc=3320,freq=3.0), product of:
              0.15430881 = queryWeight, product of:
                1.3632741 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.018329076 = queryNorm
              0.58494586 = fieldWeight in 3320, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.060001507 = weight(abstract_txt:texts in 3320) [ClassicSimilarity], result of:
            0.060001507 = score(doc=3320,freq=1.0), product of:
              0.19404334 = queryWeight, product of:
                1.8723319 = boost
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.018329076 = queryNorm
              0.30921704 = fieldWeight in 3320, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.024808073 = weight(abstract_txt:knowledge in 3320) [ClassicSimilarity], result of:
            0.024808073 = score(doc=3320,freq=1.0), product of:
              0.12768373 = queryWeight, product of:
                1.9607652 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.018329076 = queryNorm
              0.19429314 = fieldWeight in 3320, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.06336304 = weight(abstract_txt:text in 3320) [ClassicSimilarity], result of:
            0.06336304 = score(doc=3320,freq=3.0), product of:
              0.16542093 = queryWeight, product of:
                2.2317886 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.018329076 = queryNorm
              0.38304123 = fieldWeight in 3320, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.074311025 = weight(abstract_txt:automatically in 3320) [ClassicSimilarity], result of:
            0.074311025 = score(doc=3320,freq=1.0), product of:
              0.24630451 = queryWeight, product of:
                2.4357853 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.018329076 = queryNorm
              0.30170387 = fieldWeight in 3320, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.12268469 = weight(abstract_txt:concept in 3320) [ClassicSimilarity], result of:
            0.12268469 = score(doc=3320,freq=3.0), product of:
              0.28747675 = queryWeight, product of:
                3.4811537 = boost
                4.505458 = idf(docFreq=1327, maxDocs=44218)
                0.018329076 = queryNorm
              0.42676386 = fieldWeight in 3320, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.505458 = idf(docFreq=1327, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
        0.36 = coord(9/25)
    
  5. Dumais, S.T.: Latent semantic analysis (2003) 0.17
    0.17404641 = sum of:
      0.17404641 = product of:
        0.3625967 = sum of:
          0.016621612 = weight(abstract_txt:techniques in 2462) [ClassicSimilarity], result of:
            0.016621612 = score(doc=2462,freq=2.0), product of:
              0.08302796 = queryWeight, product of:
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.018329076 = queryNorm
              0.20019296 = fieldWeight in 2462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.01337636 = weight(abstract_txt:their in 2462) [ClassicSimilarity], result of:
            0.01337636 = score(doc=2462,freq=5.0), product of:
              0.060587723 = queryWeight, product of:
                1.0462266 = boost
                3.1594994 = idf(docFreq=5101, maxDocs=44218)
                0.018329076 = queryNorm
              0.22077674 = fieldWeight in 2462, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.1594994 = idf(docFreq=5101, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.016571648 = weight(abstract_txt:natural in 2462) [ClassicSimilarity], result of:
            0.016571648 = score(doc=2462,freq=1.0), product of:
              0.104398936 = queryWeight, product of:
                1.1213362 = boost
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.018329076 = queryNorm
              0.15873389 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.011140768 = weight(abstract_txt:using in 2462) [ClassicSimilarity], result of:
            0.011140768 = score(doc=2462,freq=2.0), product of:
              0.07279185 = queryWeight, product of:
                1.1467661 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.018329076 = queryNorm
              0.15304966 = fieldWeight in 2462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.03825351 = weight(abstract_txt:technique in 2462) [ClassicSimilarity], result of:
            0.03825351 = score(doc=2462,freq=3.0), product of:
              0.12643269 = queryWeight, product of:
                1.2340066 = boost
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.018329076 = queryNorm
              0.30256027 = fieldWeight in 2462, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.024632838 = weight(abstract_txt:short in 2462) [ClassicSimilarity], result of:
            0.024632838 = score(doc=2462,freq=1.0), product of:
              0.13597588 = queryWeight, product of:
                1.2797313 = boost
                5.79699 = idf(docFreq=364, maxDocs=44218)
                0.018329076 = queryNorm
              0.18115593 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.79699 = idf(docFreq=364, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.029778834 = weight(abstract_txt:mining in 2462) [ClassicSimilarity], result of:
            0.029778834 = score(doc=2462,freq=1.0), product of:
              0.15430881 = queryWeight, product of:
                1.3632741 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.018329076 = queryNorm
              0.19298208 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.031919498 = weight(abstract_txt:generated in 2462) [ClassicSimilarity], result of:
            0.031919498 = score(doc=2462,freq=1.0), product of:
              0.1850064 = queryWeight, product of:
                1.8282131 = boost
                5.52102 = idf(docFreq=480, maxDocs=44218)
                0.018329076 = queryNorm
              0.17253187 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.52102 = idf(docFreq=480, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.06857315 = weight(abstract_txt:texts in 2462) [ClassicSimilarity], result of:
            0.06857315 = score(doc=2462,freq=4.0), product of:
              0.19404334 = queryWeight, product of:
                1.8723319 = boost
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.018329076 = queryNorm
              0.3533909 = fieldWeight in 2462, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.02004795 = weight(abstract_txt:knowledge in 2462) [ClassicSimilarity], result of:
            0.02004795 = score(doc=2462,freq=2.0), product of:
              0.12768373 = queryWeight, product of:
                1.9607652 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.018329076 = queryNorm
              0.15701257 = fieldWeight in 2462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.05120507 = weight(abstract_txt:text in 2462) [ClassicSimilarity], result of:
            0.05120507 = score(doc=2462,freq=6.0), product of:
              0.16542093 = queryWeight, product of:
                2.2317886 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.018329076 = queryNorm
              0.30954406 = fieldWeight in 2462, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.04047545 = weight(abstract_txt:concept in 2462) [ClassicSimilarity], result of:
            0.04047545 = score(doc=2462,freq=1.0), product of:
              0.28747675 = queryWeight, product of:
                3.4811537 = boost
                4.505458 = idf(docFreq=1327, maxDocs=44218)
                0.018329076 = queryNorm
              0.14079556 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.505458 = idf(docFreq=1327, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
        0.48 = coord(12/25)