Document (#15669)

Author
Goh, A.
Hui, S.C.
Title
TES: a text extraction system
Source
Microcomputers for information management. 13(1996) no.1, S.41-55
Year
1996
Abstract
With the onset of the information explosion arising from digital libraries and access to a wealth of information through the Internet, the need to efficiently determine the relevance of a document becomes even more urgent. Describes a text extraction system (TES), which retrieves a set of sentences from a document to form an indicative abstract. Such an automated process enables information to be filtered more quickly. Discusses the combination of various text extraction techniques. Compares results with manually produced abstracts
Theme
Automatisches Abstracting
Object
TES

Similar documents (content)

  1. Goh, A.; Hui, S.C.; Chan, S.K.: ¬A text extraction system for news reports (1996) 0.26
    0.2592035 = sum of:
      0.2592035 = product of:
        0.9257268 = sum of:
          0.09712163 = weight(abstract_txt:abstracts in 6670) [ClassicSimilarity], result of:
            0.09712163 = score(doc=6670,freq=4.0), product of:
              0.13041835 = queryWeight, product of:
                1.0575674 = boost
                5.9575443 = idf(docFreq=294, maxDocs=41962)
                0.020699669 = queryNorm
              0.74469304 = fieldWeight in 6670, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.9575443 = idf(docFreq=294, maxDocs=41962)
                0.0625 = fieldNorm(doc=6670)
          0.09719029 = weight(abstract_txt:manually in 6670) [ClassicSimilarity], result of:
            0.09719029 = score(doc=6670,freq=2.0), product of:
              0.16439424 = queryWeight, product of:
                1.1873589 = boost
                6.6886926 = idf(docFreq=141, maxDocs=41962)
                0.020699669 = queryNorm
              0.5912025 = fieldWeight in 6670, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.6886926 = idf(docFreq=141, maxDocs=41962)
                0.0625 = fieldNorm(doc=6670)
          0.024707492 = weight(abstract_txt:system in 6670) [ClassicSimilarity], result of:
            0.024707492 = score(doc=6670,freq=2.0), product of:
              0.083119035 = queryWeight, product of:
                1.1939989 = boost
                3.3630488 = idf(docFreq=3949, maxDocs=41962)
                0.020699669 = queryNorm
              0.29725432 = fieldWeight in 6670, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3630488 = idf(docFreq=3949, maxDocs=41962)
                0.0625 = fieldNorm(doc=6670)
          0.11281161 = weight(abstract_txt:sentences in 6670) [ClassicSimilarity], result of:
            0.11281161 = score(doc=6670,freq=2.0), product of:
              0.18156853 = queryWeight, product of:
                1.2478403 = boost
                7.0293994 = idf(docFreq=100, maxDocs=41962)
                0.020699669 = queryNorm
              0.62131697 = fieldWeight in 6670, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0293994 = idf(docFreq=100, maxDocs=41962)
                0.0625 = fieldNorm(doc=6670)
          0.17569289 = weight(abstract_txt:indicative in 6670) [ClassicSimilarity], result of:
            0.17569289 = score(doc=6670,freq=2.0), product of:
              0.24395375 = queryWeight, product of:
                1.4464134 = boost
                8.148012 = idf(docFreq=32, maxDocs=41962)
                0.020699669 = queryNorm
              0.72018933 = fieldWeight in 6670, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.148012 = idf(docFreq=32, maxDocs=41962)
                0.0625 = fieldNorm(doc=6670)
          0.045962073 = weight(abstract_txt:text in 6670) [ClassicSimilarity], result of:
            0.045962073 = score(doc=6670,freq=1.0), product of:
              0.18132383 = queryWeight, product of:
                2.1598659 = boost
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.020699669 = queryNorm
              0.2534806 = fieldWeight in 6670, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.0625 = fieldNorm(doc=6670)
          0.37224075 = weight(abstract_txt:extraction in 6670) [ClassicSimilarity], result of:
            0.37224075 = score(doc=6670,freq=5.0), product of:
              0.4276427 = queryWeight, product of:
                3.316957 = boost
                6.2284193 = idf(docFreq=224, maxDocs=41962)
                0.020699669 = queryNorm
              0.87044805 = fieldWeight in 6670, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.2284193 = idf(docFreq=224, maxDocs=41962)
                0.0625 = fieldNorm(doc=6670)
        0.28 = coord(7/25)
    
  2. Barrio, P.; Gravano, L.: Sampling strategies for information extraction over the deep web (2017) 0.14
    0.1371458 = sum of:
      0.1371458 = product of:
        0.685729 = sum of:
          0.04706033 = weight(abstract_txt:enables in 413) [ClassicSimilarity], result of:
            0.04706033 = score(doc=413,freq=1.0), product of:
              0.13960876 = queryWeight, product of:
                1.0941957 = boost
                6.163881 = idf(docFreq=239, maxDocs=41962)
                0.020699669 = queryNorm
              0.33708724 = fieldWeight in 413, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.163881 = idf(docFreq=239, maxDocs=41962)
                0.0546875 = fieldNorm(doc=413)
          0.024757747 = weight(abstract_txt:information in 413) [ClassicSimilarity], result of:
            0.024757747 = score(doc=413,freq=8.0), product of:
              0.06560856 = queryWeight, product of:
                1.2992107 = boost
                2.439594 = idf(docFreq=9945, maxDocs=41962)
                0.020699669 = queryNorm
              0.37735543 = fieldWeight in 413, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                2.439594 = idf(docFreq=9945, maxDocs=41962)
                0.0546875 = fieldNorm(doc=413)
          0.070520595 = weight(abstract_txt:document in 413) [ClassicSimilarity], result of:
            0.070520595 = score(doc=413,freq=5.0), product of:
              0.13470177 = queryWeight, product of:
                1.5199887 = boost
                4.28124 = idf(docFreq=1576, maxDocs=41962)
                0.020699669 = queryNorm
              0.5235313 = fieldWeight in 413, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.28124 = idf(docFreq=1576, maxDocs=41962)
                0.0546875 = fieldNorm(doc=413)
          0.10640369 = weight(abstract_txt:text in 413) [ClassicSimilarity], result of:
            0.10640369 = score(doc=413,freq=7.0), product of:
              0.18132383 = queryWeight, product of:
                2.1598659 = boost
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.020699669 = queryNorm
              0.58681583 = fieldWeight in 413, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.0546875 = fieldNorm(doc=413)
          0.43698668 = weight(abstract_txt:extraction in 413) [ClassicSimilarity], result of:
            0.43698668 = score(doc=413,freq=9.0), product of:
              0.4276427 = queryWeight, product of:
                3.316957 = boost
                6.2284193 = idf(docFreq=224, maxDocs=41962)
                0.020699669 = queryNorm
              1.02185 = fieldWeight in 413, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.2284193 = idf(docFreq=224, maxDocs=41962)
                0.0546875 = fieldNorm(doc=413)
        0.2 = coord(5/25)
    
  3. Reeve, L.H.; Han, H.; Brooks, A.D.: ¬The use of domain-specific concepts in biomedical text summarization (2007) 0.12
    0.12207365 = sum of:
      0.12207365 = product of:
        0.4359773 = sum of:
          0.053783234 = weight(abstract_txt:enables in 2956) [ClassicSimilarity], result of:
            0.053783234 = score(doc=2956,freq=1.0), product of:
              0.13960876 = queryWeight, product of:
                1.0941957 = boost
                6.163881 = idf(docFreq=239, maxDocs=41962)
                0.020699669 = queryNorm
              0.38524255 = fieldWeight in 2956, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.163881 = idf(docFreq=239, maxDocs=41962)
                0.0625 = fieldNorm(doc=2956)
          0.055849917 = weight(abstract_txt:abstract in 2956) [ClassicSimilarity], result of:
            0.055849917 = score(doc=2956,freq=1.0), product of:
              0.14316265 = queryWeight, product of:
                1.1080352 = boost
                6.2418423 = idf(docFreq=221, maxDocs=41962)
                0.020699669 = queryNorm
              0.39011514 = fieldWeight in 2956, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2418423 = idf(docFreq=221, maxDocs=41962)
                0.0625 = fieldNorm(doc=2956)
          0.06872391 = weight(abstract_txt:manually in 2956) [ClassicSimilarity], result of:
            0.06872391 = score(doc=2956,freq=1.0), product of:
              0.16439424 = queryWeight, product of:
                1.1873589 = boost
                6.6886926 = idf(docFreq=141, maxDocs=41962)
                0.020699669 = queryNorm
              0.4180433 = fieldWeight in 2956, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6886926 = idf(docFreq=141, maxDocs=41962)
                0.0625 = fieldNorm(doc=2956)
          0.024707492 = weight(abstract_txt:system in 2956) [ClassicSimilarity], result of:
            0.024707492 = score(doc=2956,freq=2.0), product of:
              0.083119035 = queryWeight, product of:
                1.1939989 = boost
                3.3630488 = idf(docFreq=3949, maxDocs=41962)
                0.020699669 = queryNorm
              0.29725432 = fieldWeight in 2956, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3630488 = idf(docFreq=3949, maxDocs=41962)
                0.0625 = fieldNorm(doc=2956)
          0.11281161 = weight(abstract_txt:sentences in 2956) [ClassicSimilarity], result of:
            0.11281161 = score(doc=2956,freq=2.0), product of:
              0.18156853 = queryWeight, product of:
                1.2478403 = boost
                7.0293994 = idf(docFreq=100, maxDocs=41962)
                0.020699669 = queryNorm
              0.62131697 = fieldWeight in 2956, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0293994 = idf(docFreq=100, maxDocs=41962)
                0.0625 = fieldNorm(doc=2956)
          0.017326813 = weight(abstract_txt:information in 2956) [ClassicSimilarity], result of:
            0.017326813 = score(doc=2956,freq=3.0), product of:
              0.06560856 = queryWeight, product of:
                1.2992107 = boost
                2.439594 = idf(docFreq=9945, maxDocs=41962)
                0.020699669 = queryNorm
              0.2640938 = fieldWeight in 2956, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.439594 = idf(docFreq=9945, maxDocs=41962)
                0.0625 = fieldNorm(doc=2956)
          0.10277432 = weight(abstract_txt:text in 2956) [ClassicSimilarity], result of:
            0.10277432 = score(doc=2956,freq=5.0), product of:
              0.18132383 = queryWeight, product of:
                2.1598659 = boost
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.020699669 = queryNorm
              0.5667999 = fieldWeight in 2956, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.0625 = fieldNorm(doc=2956)
        0.28 = coord(7/25)
    
  4. Wang, P.; Hao, T.; Yan, J.; Jin, L.: Large-scale extraction of drug-disease pairs from the medical literature (2017) 0.12
    0.12048721 = sum of:
      0.12048721 = product of:
        0.5020301 = sum of:
          0.048560817 = weight(abstract_txt:abstracts in 492) [ClassicSimilarity], result of:
            0.048560817 = score(doc=492,freq=1.0), product of:
              0.13041835 = queryWeight, product of:
                1.0575674 = boost
                5.9575443 = idf(docFreq=294, maxDocs=41962)
                0.020699669 = queryNorm
              0.37234652 = fieldWeight in 492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9575443 = idf(docFreq=294, maxDocs=41962)
                0.0625 = fieldNorm(doc=492)
          0.06872391 = weight(abstract_txt:manually in 492) [ClassicSimilarity], result of:
            0.06872391 = score(doc=492,freq=1.0), product of:
              0.16439424 = queryWeight, product of:
                1.1873589 = boost
                6.6886926 = idf(docFreq=141, maxDocs=41962)
                0.020699669 = queryNorm
              0.4180433 = fieldWeight in 492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6886926 = idf(docFreq=141, maxDocs=41962)
                0.0625 = fieldNorm(doc=492)
          0.07431577 = weight(abstract_txt:efficiently in 492) [ClassicSimilarity], result of:
            0.07431577 = score(doc=492,freq=1.0), product of:
              0.173195 = queryWeight, product of:
                1.2187269 = boost
                6.865396 = idf(docFreq=118, maxDocs=41962)
                0.020699669 = queryNorm
              0.42908725 = fieldWeight in 492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.865396 = idf(docFreq=118, maxDocs=41962)
                0.0625 = fieldNorm(doc=492)
          0.010003641 = weight(abstract_txt:information in 492) [ClassicSimilarity], result of:
            0.010003641 = score(doc=492,freq=1.0), product of:
              0.06560856 = queryWeight, product of:
                1.2992107 = boost
                2.439594 = idf(docFreq=9945, maxDocs=41962)
                0.020699669 = queryNorm
              0.15247463 = fieldWeight in 492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.439594 = idf(docFreq=9945, maxDocs=41962)
                0.0625 = fieldNorm(doc=492)
          0.06500019 = weight(abstract_txt:text in 492) [ClassicSimilarity], result of:
            0.06500019 = score(doc=492,freq=2.0), product of:
              0.18132383 = queryWeight, product of:
                2.1598659 = boost
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.020699669 = queryNorm
              0.35847571 = fieldWeight in 492, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.0625 = fieldNorm(doc=492)
          0.23542574 = weight(abstract_txt:extraction in 492) [ClassicSimilarity], result of:
            0.23542574 = score(doc=492,freq=2.0), product of:
              0.4276427 = queryWeight, product of:
                3.316957 = boost
                6.2284193 = idf(docFreq=224, maxDocs=41962)
                0.020699669 = queryNorm
              0.5505197 = fieldWeight in 492, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2284193 = idf(docFreq=224, maxDocs=41962)
                0.0625 = fieldNorm(doc=492)
        0.24 = coord(6/25)
    
  5. Zhou, G.D.; Zhang, M.: Extracting relation information from text documents by exploring various types of knowledge (2007) 0.12
    0.11787956 = sum of:
      0.11787956 = product of:
        0.5893978 = sum of:
          0.053783234 = weight(abstract_txt:enables in 2928) [ClassicSimilarity], result of:
            0.053783234 = score(doc=2928,freq=1.0), product of:
              0.13960876 = queryWeight, product of:
                1.0941957 = boost
                6.163881 = idf(docFreq=239, maxDocs=41962)
                0.020699669 = queryNorm
              0.38524255 = fieldWeight in 2928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.163881 = idf(docFreq=239, maxDocs=41962)
                0.0625 = fieldNorm(doc=2928)
          0.024707492 = weight(abstract_txt:system in 2928) [ClassicSimilarity], result of:
            0.024707492 = score(doc=2928,freq=2.0), product of:
              0.083119035 = queryWeight, product of:
                1.1939989 = boost
                3.3630488 = idf(docFreq=3949, maxDocs=41962)
                0.020699669 = queryNorm
              0.29725432 = fieldWeight in 2928, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3630488 = idf(docFreq=3949, maxDocs=41962)
                0.0625 = fieldNorm(doc=2928)
          0.024503818 = weight(abstract_txt:information in 2928) [ClassicSimilarity], result of:
            0.024503818 = score(doc=2928,freq=6.0), product of:
              0.06560856 = queryWeight, product of:
                1.2992107 = boost
                2.439594 = idf(docFreq=9945, maxDocs=41962)
                0.020699669 = queryNorm
              0.37348506 = fieldWeight in 2928, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                2.439594 = idf(docFreq=9945, maxDocs=41962)
                0.0625 = fieldNorm(doc=2928)
          0.045962073 = weight(abstract_txt:text in 2928) [ClassicSimilarity], result of:
            0.045962073 = score(doc=2928,freq=1.0), product of:
              0.18132383 = queryWeight, product of:
                2.1598659 = boost
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.020699669 = queryNorm
              0.2534806 = fieldWeight in 2928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.0625 = fieldNorm(doc=2928)
          0.4404412 = weight(abstract_txt:extraction in 2928) [ClassicSimilarity], result of:
            0.4404412 = score(doc=2928,freq=7.0), product of:
              0.4276427 = queryWeight, product of:
                3.316957 = boost
                6.2284193 = idf(docFreq=224, maxDocs=41962)
                0.020699669 = queryNorm
              1.029928 = fieldWeight in 2928, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.2284193 = idf(docFreq=224, maxDocs=41962)
                0.0625 = fieldNorm(doc=2928)
        0.2 = coord(5/25)