Document (#6684)

Author
Rau, L.F.
Jacobs, P.S.
Zernik, U.
Title
Information extraction and text summarization using linguistic knowledge acquisition
Source
Information processing and management. 25(1989) no.4, S.419-428
Year
1989
Abstract
Storing and accessing texts in a conceptual format has a number of advantages over traditional document retrieval methods. A conceptual format facilitates natural language access to text information. It can support imprecise and inexact queries, conceptual information summarisation, and, ultimately, document translation. Describes 2 methods which have been implemented in a prototype intelligent information retrieval system calles SCISOR (System for Conceptual Information Summarisation, Organization and Retrieval). Describes the text processing, language acquisition, and summarisation components of SCISOR
Theme
Computerlinguistik
Object
SCISOR

Similar documents (author)

  1. Jacobs, M.: Criteria for evaluating alternative MEDLINE search engines (1998) 5.39
    5.393951 = sum of:
      5.393951 = weight(author_txt:jacobs in 5265) [ClassicSimilarity], result of:
        5.393951 = score(doc=5265,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.630322 = idf(docFreq=20, maxDocs=43254)
            0.11587052 = queryNorm
          5.3939514 = fieldWeight in 5265, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.630322 = idf(docFreq=20, maxDocs=43254)
            0.625 = fieldNorm(doc=5265)
    
  2. Jacobs, E.H.: Buying into classes : the practice of book selection in eighteenth-Century Britain (1999) 5.39
    5.393951 = sum of:
      5.393951 = weight(author_txt:jacobs in 1155) [ClassicSimilarity], result of:
        5.393951 = score(doc=1155,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.630322 = idf(docFreq=20, maxDocs=43254)
            0.11587052 = queryNorm
          5.3939514 = fieldWeight in 1155, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.630322 = idf(docFreq=20, maxDocs=43254)
            0.625 = fieldNorm(doc=1155)
    
  3. Jacobs, C.: If a picture is worth a thousand words, then ... (1999) 5.39
    5.393951 = sum of:
      5.393951 = weight(author_txt:jacobs in 1322) [ClassicSimilarity], result of:
        5.393951 = score(doc=1322,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.630322 = idf(docFreq=20, maxDocs=43254)
            0.11587052 = queryNorm
          5.3939514 = fieldWeight in 1322, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.630322 = idf(docFreq=20, maxDocs=43254)
            0.625 = fieldNorm(doc=1322)
    
  4. Jacobs, N.: Information technology and interests in scholarly communication : a discourse analysis (2001) 5.39
    5.393951 = sum of:
      5.393951 = weight(author_txt:jacobs in 1849) [ClassicSimilarity], result of:
        5.393951 = score(doc=1849,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.630322 = idf(docFreq=20, maxDocs=43254)
            0.11587052 = queryNorm
          5.3939514 = fieldWeight in 1849, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.630322 = idf(docFreq=20, maxDocs=43254)
            0.625 = fieldNorm(doc=1849)
    
  5. Jacobs, I.: From chaos, order: W3C standard helps organize knowledge : SKOS Connects Diverse Knowledge Organization Systems to Linked Data (2009) 5.39
    5.393951 = sum of:
      5.393951 = weight(author_txt:jacobs in 63) [ClassicSimilarity], result of:
        5.393951 = score(doc=63,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.630322 = idf(docFreq=20, maxDocs=43254)
            0.11587052 = queryNorm
          5.3939514 = fieldWeight in 63, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.630322 = idf(docFreq=20, maxDocs=43254)
            0.625 = fieldNorm(doc=63)
    

Similar documents (content)

  1. Salton, G.: Automatic text structuring and summarization (1997) 0.28
    0.27641135 = sum of:
      0.27641135 = product of:
        1.3820567 = sum of:
          0.06256496 = weight(abstract_txt:extraction in 2146) [ClassicSimilarity], result of:
            0.06256496 = score(doc=2146,freq=1.0), product of:
              0.10744779 = queryWeight, product of:
                1.0724319 = boost
                6.2110133 = idf(docFreq=235, maxDocs=43254)
                0.016131148 = queryNorm
              0.5822825 = fieldWeight in 2146, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2110133 = idf(docFreq=235, maxDocs=43254)
                0.09375 = fieldNorm(doc=2146)
          0.037746634 = weight(abstract_txt:methods in 2146) [ClassicSimilarity], result of:
            0.037746634 = score(doc=2146,freq=1.0), product of:
              0.09665822 = queryWeight, product of:
                1.4384851 = boost
                4.1655097 = idf(docFreq=1824, maxDocs=43254)
                0.016131148 = queryNorm
              0.39051652 = fieldWeight in 2146, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1655097 = idf(docFreq=1824, maxDocs=43254)
                0.09375 = fieldNorm(doc=2146)
          0.082123294 = weight(abstract_txt:document in 2146) [ClassicSimilarity], result of:
            0.082123294 = score(doc=2146,freq=4.0), product of:
              0.10223766 = queryWeight, product of:
                1.4794198 = boost
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.016131148 = queryNorm
              0.8032588 = fieldWeight in 2146, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.09375 = fieldNorm(doc=2146)
          0.11634047 = weight(abstract_txt:text in 2146) [ClassicSimilarity], result of:
            0.11634047 = score(doc=2146,freq=5.0), product of:
              0.13704008 = queryWeight, product of:
                2.0977583 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.016131148 = queryNorm
              0.8489522 = fieldWeight in 2146, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.09375 = fieldNorm(doc=2146)
          1.0832814 = weight(abstract_txt:summarisation in 2146) [ClassicSimilarity], result of:
            1.0832814 = score(doc=2146,freq=3.0), product of:
              0.7191247 = queryWeight, product of:
                4.805446 = boost
                9.27695 = idf(docFreq=10, maxDocs=43254)
                0.016131148 = queryNorm
              1.5063889 = fieldWeight in 2146, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.27695 = idf(docFreq=10, maxDocs=43254)
                0.09375 = fieldNorm(doc=2146)
        0.2 = coord(5/25)
    
  2. Szlávik, Z.; Tombros, A.; Lalmas, M.: Summarisation of the logical structure of XML documents (2012) 0.22
    0.22371006 = sum of:
      0.22371006 = product of:
        1.1185503 = sum of:
          0.035587866 = weight(abstract_txt:methods in 4196) [ClassicSimilarity], result of:
            0.035587866 = score(doc=4196,freq=2.0), product of:
              0.09665822 = queryWeight, product of:
                1.4384851 = boost
                4.1655097 = idf(docFreq=1824, maxDocs=43254)
                0.016131148 = queryNorm
              0.3681825 = fieldWeight in 4196, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1655097 = idf(docFreq=1824, maxDocs=43254)
                0.0625 = fieldNorm(doc=4196)
          0.027374431 = weight(abstract_txt:document in 4196) [ClassicSimilarity], result of:
            0.027374431 = score(doc=4196,freq=1.0), product of:
              0.10223766 = queryWeight, product of:
                1.4794198 = boost
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.016131148 = queryNorm
              0.26775292 = fieldWeight in 4196, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.0625 = fieldNorm(doc=4196)
          0.021818485 = weight(abstract_txt:retrieval in 4196) [ClassicSimilarity], result of:
            0.021818485 = score(doc=4196,freq=1.0), product of:
              0.100606866 = queryWeight, product of:
                1.7974029 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.016131148 = queryNorm
              0.21686874 = fieldWeight in 4196, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.0625 = fieldNorm(doc=4196)
          0.012441962 = weight(abstract_txt:information in 4196) [ClassicSimilarity], result of:
            0.012441962 = score(doc=4196,freq=1.0), product of:
              0.082026355 = queryWeight, product of:
                2.0952349 = boost
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.016131148 = queryNorm
              0.1516825 = fieldWeight in 4196, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.0625 = fieldNorm(doc=4196)
          1.0213276 = weight(abstract_txt:summarisation in 4196) [ClassicSimilarity], result of:
            1.0213276 = score(doc=4196,freq=6.0), product of:
              0.7191247 = queryWeight, product of:
                4.805446 = boost
                9.27695 = idf(docFreq=10, maxDocs=43254)
                0.016131148 = queryNorm
              1.4202372 = fieldWeight in 4196, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                9.27695 = idf(docFreq=10, maxDocs=43254)
                0.0625 = fieldNorm(doc=4196)
        0.2 = coord(5/25)
    
  3. Huo, W.: Automatic multi-word term extraction and its application to Web-page summarization (2012) 0.20
    0.19739777 = sum of:
      0.19739777 = product of:
        0.49349442 = sum of:
          0.051200625 = weight(abstract_txt:translation in 2028) [ClassicSimilarity], result of:
            0.051200625 = score(doc=2028,freq=1.0), product of:
              0.10615676 = queryWeight, product of:
                1.0659696 = boost
                6.173587 = idf(docFreq=244, maxDocs=43254)
                0.016131148 = queryNorm
              0.4823115 = fieldWeight in 2028, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.173587 = idf(docFreq=244, maxDocs=43254)
                0.078125 = fieldNorm(doc=2028)
          0.09030475 = weight(abstract_txt:extraction in 2028) [ClassicSimilarity], result of:
            0.09030475 = score(doc=2028,freq=3.0), product of:
              0.10744779 = queryWeight, product of:
                1.0724319 = boost
                6.2110133 = idf(docFreq=235, maxDocs=43254)
                0.016131148 = queryNorm
              0.8404524 = fieldWeight in 2028, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2110133 = idf(docFreq=235, maxDocs=43254)
                0.078125 = fieldNorm(doc=2028)
          0.016577624 = weight(abstract_txt:system in 2028) [ClassicSimilarity], result of:
            0.016577624 = score(doc=2028,freq=1.0), product of:
              0.06306509 = queryWeight, product of:
                1.1619314 = boost
                3.364676 = idf(docFreq=4064, maxDocs=43254)
                0.016131148 = queryNorm
              0.2628653 = fieldWeight in 2028, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.364676 = idf(docFreq=4064, maxDocs=43254)
                0.078125 = fieldNorm(doc=2028)
          0.13731799 = weight(abstract_txt:summarization in 2028) [ClassicSimilarity], result of:
            0.13731799 = score(doc=2028,freq=3.0), product of:
              0.14208297 = queryWeight, product of:
                1.2332242 = boost
                7.1422453 = idf(docFreq=92, maxDocs=43254)
                0.016131148 = queryNorm
              0.9664634 = fieldWeight in 2028, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.1422453 = idf(docFreq=92, maxDocs=43254)
                0.078125 = fieldNorm(doc=2028)
          0.031455528 = weight(abstract_txt:methods in 2028) [ClassicSimilarity], result of:
            0.031455528 = score(doc=2028,freq=1.0), product of:
              0.09665822 = queryWeight, product of:
                1.4384851 = boost
                4.1655097 = idf(docFreq=1824, maxDocs=43254)
                0.016131148 = queryNorm
              0.32543045 = fieldWeight in 2028, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1655097 = idf(docFreq=1824, maxDocs=43254)
                0.078125 = fieldNorm(doc=2028)
          0.032063212 = weight(abstract_txt:language in 2028) [ClassicSimilarity], result of:
            0.032063212 = score(doc=2028,freq=1.0), product of:
              0.09789913 = queryWeight, product of:
                1.4476894 = boost
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.016131148 = queryNorm
              0.32751274 = fieldWeight in 2028, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.078125 = fieldNorm(doc=2028)
          0.04839161 = weight(abstract_txt:document in 2028) [ClassicSimilarity], result of:
            0.04839161 = score(doc=2028,freq=2.0), product of:
              0.10223766 = queryWeight, product of:
                1.4794198 = boost
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.016131148 = queryNorm
              0.47332472 = fieldWeight in 2028, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.078125 = fieldNorm(doc=2028)
          0.027273105 = weight(abstract_txt:retrieval in 2028) [ClassicSimilarity], result of:
            0.027273105 = score(doc=2028,freq=1.0), product of:
              0.100606866 = queryWeight, product of:
                1.7974029 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.016131148 = queryNorm
              0.27108592 = fieldWeight in 2028, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.078125 = fieldNorm(doc=2028)
          0.015552453 = weight(abstract_txt:information in 2028) [ClassicSimilarity], result of:
            0.015552453 = score(doc=2028,freq=1.0), product of:
              0.082026355 = queryWeight, product of:
                2.0952349 = boost
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.016131148 = queryNorm
              0.18960312 = fieldWeight in 2028, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.078125 = fieldNorm(doc=2028)
          0.043357532 = weight(abstract_txt:text in 2028) [ClassicSimilarity], result of:
            0.043357532 = score(doc=2028,freq=1.0), product of:
              0.13704008 = queryWeight, product of:
                2.0977583 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.016131148 = queryNorm
              0.31638578 = fieldWeight in 2028, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.078125 = fieldNorm(doc=2028)
        0.4 = coord(10/25)
    
  4. Sweeney, S.; Crestani, F.; Losada, D.E.: 'Show me more' : incremental length summarisation using novelty detection (2008) 0.18
    0.17985336 = sum of:
      0.17985336 = product of:
        0.89926684 = sum of:
          0.04545584 = weight(abstract_txt:accessing in 4055) [ClassicSimilarity], result of:
            0.04545584 = score(doc=4055,freq=1.0), product of:
              0.11378822 = queryWeight, product of:
                1.1036202 = boost
                6.391641 = idf(docFreq=196, maxDocs=43254)
                0.016131148 = queryNorm
              0.39947757 = fieldWeight in 4055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.391641 = idf(docFreq=196, maxDocs=43254)
                0.0625 = fieldNorm(doc=4055)
          0.054748863 = weight(abstract_txt:document in 4055) [ClassicSimilarity], result of:
            0.054748863 = score(doc=4055,freq=4.0), product of:
              0.10223766 = queryWeight, product of:
                1.4794198 = boost
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.016131148 = queryNorm
              0.53550583 = fieldWeight in 4055, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.0625 = fieldNorm(doc=4055)
          0.027821073 = weight(abstract_txt:information in 4055) [ClassicSimilarity], result of:
            0.027821073 = score(doc=4055,freq=5.0), product of:
              0.082026355 = queryWeight, product of:
                2.0952349 = boost
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.016131148 = queryNorm
              0.33917236 = fieldWeight in 4055, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.0625 = fieldNorm(doc=4055)
          0.04905345 = weight(abstract_txt:text in 4055) [ClassicSimilarity], result of:
            0.04905345 = score(doc=4055,freq=2.0), product of:
              0.13704008 = queryWeight, product of:
                2.0977583 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.016131148 = queryNorm
              0.35794964 = fieldWeight in 4055, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.0625 = fieldNorm(doc=4055)
          0.7221876 = weight(abstract_txt:summarisation in 4055) [ClassicSimilarity], result of:
            0.7221876 = score(doc=4055,freq=3.0), product of:
              0.7191247 = queryWeight, product of:
                4.805446 = boost
                9.27695 = idf(docFreq=10, maxDocs=43254)
                0.016131148 = queryNorm
              1.0042592 = fieldWeight in 4055, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.27695 = idf(docFreq=10, maxDocs=43254)
                0.0625 = fieldNorm(doc=4055)
        0.2 = coord(5/25)
    
  5. Lihui, C.; Lian, C.W.: Using Web structure and summarisation techniques for Web content mining (2005) 0.18
    0.17909224 = sum of:
      0.17909224 = product of:
        0.7462177 = sum of:
          0.035928465 = weight(abstract_txt:prototype in 3047) [ClassicSimilarity], result of:
            0.035928465 = score(doc=3047,freq=1.0), product of:
              0.09727396 = queryWeight, product of:
                1.0203973 = boost
                5.9096537 = idf(docFreq=318, maxDocs=43254)
                0.016131148 = queryNorm
              0.36935335 = fieldWeight in 3047, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9096537 = idf(docFreq=318, maxDocs=43254)
                0.0625 = fieldNorm(doc=3047)
          0.04249822 = weight(abstract_txt:intelligent in 3047) [ClassicSimilarity], result of:
            0.04249822 = score(doc=3047,freq=1.0), product of:
              0.10879727 = queryWeight, product of:
                1.0791454 = boost
                6.249895 = idf(docFreq=226, maxDocs=43254)
                0.016131148 = queryNorm
              0.39061844 = fieldWeight in 3047, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.249895 = idf(docFreq=226, maxDocs=43254)
                0.0625 = fieldNorm(doc=3047)
          0.038713288 = weight(abstract_txt:document in 3047) [ClassicSimilarity], result of:
            0.038713288 = score(doc=3047,freq=2.0), product of:
              0.10223766 = queryWeight, product of:
                1.4794198 = boost
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.016131148 = queryNorm
              0.37865978 = fieldWeight in 3047, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.0625 = fieldNorm(doc=3047)
          0.021818485 = weight(abstract_txt:retrieval in 3047) [ClassicSimilarity], result of:
            0.021818485 = score(doc=3047,freq=1.0), product of:
              0.100606866 = queryWeight, product of:
                1.7974029 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.016131148 = queryNorm
              0.21686874 = fieldWeight in 3047, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.0625 = fieldNorm(doc=3047)
          0.017595591 = weight(abstract_txt:information in 3047) [ClassicSimilarity], result of:
            0.017595591 = score(doc=3047,freq=2.0), product of:
              0.082026355 = queryWeight, product of:
                2.0952349 = boost
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.016131148 = queryNorm
              0.21451144 = fieldWeight in 3047, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.0625 = fieldNorm(doc=3047)
          0.5896637 = weight(abstract_txt:summarisation in 3047) [ClassicSimilarity], result of:
            0.5896637 = score(doc=3047,freq=2.0), product of:
              0.7191247 = queryWeight, product of:
                4.805446 = boost
                9.27695 = idf(docFreq=10, maxDocs=43254)
                0.016131148 = queryNorm
              0.81997424 = fieldWeight in 3047, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.27695 = idf(docFreq=10, maxDocs=43254)
                0.0625 = fieldNorm(doc=3047)
        0.24 = coord(6/25)