Search (15 results, page 1 of 1)

Salton, G.; Buckley, C.: Approaches to global text analysis (1990) 0.02

0.017083302 = product of:
  0.07687486 = sum of:
    0.05872617 = weight(_text_:applications in 4901) [ClassicSimilarity], result of:
      0.05872617 = score(doc=4901,freq=2.0), product of:
        0.17247584 = queryWeight, product of:
          4.4025097 = idf(docFreq=1471, maxDocs=44218)
          0.03917671 = queryNorm
        0.34048924 = fieldWeight in 4901, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.4025097 = idf(docFreq=1471, maxDocs=44218)
          0.0546875 = fieldNorm(doc=4901)
    0.018148692 = weight(_text_:of in 4901) [ClassicSimilarity], result of:
      0.018148692 = score(doc=4901,freq=12.0), product of:
        0.061262865 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.03917671 = queryNorm
        0.29624295 = fieldWeight in 4901, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0546875 = fieldNorm(doc=4901)
  0.22222222 = coord(2/9)

Abstract: Current approaches to the analysis of natural language text are not viable for documents of unrestricted scope. A global text analysis system is proposed designed to identify homogeneous text environments in which the meaning of text words and phrases remains unambiguous, and useful term relationships may be automatically determined. The proposed methods include document clustering methods, as well as comparisons of local document excerpts in specified global contexts, leading to structured text representations in which similar texts, or text excerpts, are appropriately linked
Source: ASIS'90: Information in the year 2000, from research to applications. Proc. of the 53rd Annual Meeting of the American Society for Information Science, Toronto, Canada, 4.-8.11.1990. Ed. by Diana Henderson

Wong, S.K.M.; Yao, Y.Y.; Salton, G.; Buckley, C.: Evaluation of an adaptive linear model (1991) 0.00

0.002489248 = product of:
  0.022403233 = sum of:
    0.022403233 = weight(_text_:of in 4836) [ClassicSimilarity], result of:
      0.022403233 = score(doc=4836,freq=14.0), product of:
        0.061262865 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.03917671 = queryNorm
        0.36569026 = fieldWeight in 4836, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0625 = fieldNorm(doc=4836)
  0.11111111 = coord(1/9)

Abstract: Reports on the experimental evaluation of an adaptive linear model that constructs improved user query vectors from user preference judgements on a sample set of documents. The performance of this method is compared with that of the standard relevance feedback techniques. The experimental results seem to demonstrate the effectiveness of the adaptive method
Source: Journal of the American Society for Information Science. 42(1991) no.10, S.723-730

Salton, G.; Araya, J.: On the use of clustered file organizations in information search and retrieval (1990) 0.00

0.0024443932 = product of:
  0.021999538 = sum of:
    0.021999538 = weight(_text_:of in 2409) [ClassicSimilarity], result of:
      0.021999538 = score(doc=2409,freq=6.0), product of:
        0.061262865 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.03917671 = queryNorm
        0.3591007 = fieldWeight in 2409, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.09375 = fieldNorm(doc=2409)
  0.11111111 = coord(1/9)

Imprint: Edmonton, Alberta : Univ. of Alberta, Faculty of Extension

Buckley, C.; Allan, J.; Salton, G.: Automatic routing and retrieval using Smart : TREC-2 (1995) 0.00
```
0.0022314154 = product of:
  0.020082738 = sum of:
    0.020082738 = weight(_text_:of in 5699) [ClassicSimilarity], result of:
      0.020082738 = score(doc=5699,freq=20.0), product of:
        0.061262865 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.03917671 = queryNorm
        0.32781258 = fieldWeight in 5699, product of:
          4.472136 = tf(freq=20.0), with freq of:
            20.0 = termFreq=20.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=5699)
  0.11111111 = coord(1/9)
```
Abstract

The Smart information retrieval project emphazises completely automatic approaches to the understanding and retrieval of large quantities of text. The work in the TREC-2 environment continues, performing both routing and ad hoc experiments. The ad hoc work extends investigations into combining global similarities, giving an overall indication of how a document matches a query, with local similarities identifying a smaller part of the document that matches the query. The performance of ad hoc runs is good, but it is clear that full advantage of the available local information is not been taken advantage of. The routing experiments use conventional relevance feedback approaches to routing, but with a much greater degree of query expansion than was previously done. The length of a query vector is increased by a factor of 5 to 10 by adding terms found in previously seen relevant documents. This approach improves effectiveness by 30-40% over the original query
Salton, G.: Automatic text structuring and summarization (1997) 0.00
```
0.0021780923 = product of:
  0.01960283 = sum of:
    0.01960283 = weight(_text_:of in 145) [ClassicSimilarity], result of:
      0.01960283 = score(doc=145,freq=14.0), product of:
        0.061262865 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.03917671 = queryNorm
        0.31997898 = fieldWeight in 145, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0546875 = fieldNorm(doc=145)
  0.11111111 = coord(1/9)
```
Abstract

Applies the ideas from the automatic link generation research to automatic text summarisation. Using techniques for inter-document link generation, generates intra-document links between passages of a document. Based on the intra-document linkage pattern of a text, characterises the structure of the text. Applies the knowledge of text structure to do automatic text summarisation by passage extraction. Evaluates a set of 50 summaries generated using these techniques by comparing the to paragraph extracts constructed by humans. The automatic summarisation methods perform well, especially in view of the fact that the summaries generates by 2 humans for the same article are surprisingly dissimilar

Footnote

Contribution to a special issue on methods and tools for the automatic construction of hypertext

Salton, G.: ¬The state of retrieval system evaluation (1992) 0.00

0.0021037988 = product of:
  0.018934188 = sum of:
    0.018934188 = weight(_text_:of in 5250) [ClassicSimilarity], result of:
      0.018934188 = score(doc=5250,freq=10.0), product of:
        0.061262865 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.03917671 = queryNorm
        0.3090647 = fieldWeight in 5250, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0625 = fieldNorm(doc=5250)
  0.11111111 = coord(1/9)

Abstract: Substatioal misgivings have been voiced over the years about the methodologies used to evaluate IR procedures and about the credibility of many of the available test results. In this note, an attempt is made to review the state of retrieval evaluation and to separate certain misgivings about the design of retrieval tests from conclusions that can legitimately be drawn from the evaluation results

Salton, G.; Allan, J.: Selective text utilization and text traversal (1995) 0.00

0.0018816947 = product of:
  0.016935252 = sum of:
    0.016935252 = weight(_text_:of in 6805) [ClassicSimilarity], result of:
      0.016935252 = score(doc=6805,freq=2.0), product of:
        0.061262865 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.03917671 = queryNorm
        0.27643585 = fieldWeight in 6805, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.125 = fieldNorm(doc=6805)
  0.11111111 = coord(1/9)

Source: International journal of human-computer studies. 43(1995) no.3, S.xxx-xxx

Salton, G.; Buckley, C.; Smith, M.: On the application of syntactic methodologies in automatic text analysis (1990) 0.00

0.0016464829 = product of:
  0.014818345 = sum of:
    0.014818345 = weight(_text_:of in 7864) [ClassicSimilarity], result of:
      0.014818345 = score(doc=7864,freq=2.0), product of:
        0.061262865 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.03917671 = queryNorm
        0.24188137 = fieldWeight in 7864, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.109375 = fieldNorm(doc=7864)
  0.11111111 = coord(1/9)

Salton, G.; Allen, J.; Buckley, C.; Singhal, A.: Automatic analysis, theme generation, and summarization of machine-readable data (1994) 0.00

0.0016464829 = product of:
  0.014818345 = sum of:
    0.014818345 = weight(_text_:of in 1168) [ClassicSimilarity], result of:
      0.014818345 = score(doc=1168,freq=2.0), product of:
        0.061262865 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.03917671 = queryNorm
        0.24188137 = fieldWeight in 1168, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.109375 = fieldNorm(doc=1168)
  0.11111111 = coord(1/9)

Salton, G.; Buckley, C.; Allan, J.: Automatic structuring of text files (1992) 0.00
```
0.0016295954 = product of:
  0.014666359 = sum of:
    0.014666359 = weight(_text_:of in 6507) [ClassicSimilarity], result of:
      0.014666359 = score(doc=6507,freq=6.0), product of:
        0.061262865 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.03917671 = queryNorm
        0.23940048 = fieldWeight in 6507, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0625 = fieldNorm(doc=6507)
  0.11111111 = coord(1/9)
```
Abstract

In many practical information retrieval situations, it is necessary to process heterogeneous text databases that vary greatly in scope and coverage and deal with many different subjects. In such an environment it is important to provide flexible access to individual text pieces and to structure the collection so that related text elements are identified and properly linked. Describes methods for the automatic structuring of heterogeneous text collections and the construction of browsing tools and access procedures that facilitate collection use. Illustrates these emthods with searches using a large automated encyclopedia

Buckley, C.; Singhal, A.; Mitra, M.; Salton, G.: New retrieval approaches using SMART : TREC 4 (1996) 0.00

0.0014112709 = product of:
  0.012701439 = sum of:
    0.012701439 = weight(_text_:of in 7528) [ClassicSimilarity], result of:
      0.012701439 = score(doc=7528,freq=2.0), product of:
        0.061262865 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.03917671 = queryNorm
        0.20732689 = fieldWeight in 7528, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.09375 = fieldNorm(doc=7528)
  0.11111111 = coord(1/9)

Imprint: Gaithersburgh, MD : National Institute of Standards and Technology

Salton, G.; Buckley, C.: Improving retrieval performance by relevance feedback (1990) 0.00
```
0.001330559 = product of:
  0.011975031 = sum of:
    0.011975031 = weight(_text_:of in 5442) [ClassicSimilarity], result of:
      0.011975031 = score(doc=5442,freq=4.0), product of:
        0.061262865 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.03917671 = queryNorm
        0.19546966 = fieldWeight in 5442, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0625 = fieldNorm(doc=5442)
  0.11111111 = coord(1/9)
```
Abstract

Relevance feedback is an automatic process, introduced over 20 years ago, designed to produce improved query formulations following an initial retrieval operation. The principal relevance feedback methods described over the years are examined briefly, and evaluation data are included to demonstrate the effectiveness of the various methods. Prescriptions are given for conducting text retrieval operations iteratively using relevance feedback

Source

Journal of the American Society for Information Science. 41(1990) no.4, S.288-297

Salton, G.; Allan, J.; Buckley, C.; Singhal, A.: Automatic analysis, theme generation, and summarization of machine readable texts (1994) 0.00

0.0011760591 = product of:
  0.010584532 = sum of:
    0.010584532 = weight(_text_:of in 1949) [ClassicSimilarity], result of:
      0.010584532 = score(doc=1949,freq=2.0), product of:
        0.061262865 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.03917671 = queryNorm
        0.17277241 = fieldWeight in 1949, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.078125 = fieldNorm(doc=1949)
  0.11111111 = coord(1/9)

Salton, G.; Allan, J.; Singhal, A.: Automatic text decomposition and structuring (1996) 0.00

9.408473E-4 = product of:
  0.008467626 = sum of:
    0.008467626 = weight(_text_:of in 4067) [ClassicSimilarity], result of:
      0.008467626 = score(doc=4067,freq=2.0), product of:
        0.061262865 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.03917671 = queryNorm
        0.13821793 = fieldWeight in 4067, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0625 = fieldNorm(doc=4067)
  0.11111111 = coord(1/9)

Abstract: Sophisticated text similarity measurements are used to determine relationships between natural language text and text excerpts. The resulting linked hypertext maps can be decomposed into text segments and text theme, and these decompositions are usable to identify different text types and text structures, leading to improved text access and utilization. Gives examples of text decomposition for expository and non expository texts

Salton, G.: ¬A note about information science research (1997) 0.00

9.408473E-4 = product of:
  0.008467626 = sum of:
    0.008467626 = weight(_text_:of in 582) [ClassicSimilarity], result of:
      0.008467626 = score(doc=582,freq=2.0), product of:
        0.061262865 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.03917671 = queryNorm
        0.13821793 = fieldWeight in 582, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0625 = fieldNorm(doc=582)
  0.11111111 = coord(1/9)

Source: From classification to 'knowledge organization': Dorking revisited or 'past is prelude'. A collection of reprints to commemorate the firty year span between the Dorking Conference (First International Study Conference on Classification Research 1957) and the Sixth International Study Conference on Classification Research (London 1997). Ed.: A. Gilchrist

Search (15 results, page 1 of 1)

Authors

Themes