Search (4 results, page 1 of 1)

Buckley, C.; Allan, J.; Salton, G.: Automatic routing and retrieval using Smart : TREC-2 (1995) 0.00
```
0.003091229 = product of:
  0.012364916 = sum of:
    0.012364916 = weight(_text_:information in 5699) [ClassicSimilarity], result of:
      0.012364916 = score(doc=5699,freq=6.0), product of:
        0.06134496 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.034944877 = queryNorm
        0.20156369 = fieldWeight in 5699, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=5699)
  0.25 = coord(1/4)
```
Abstract

The Smart information retrieval project emphazises completely automatic approaches to the understanding and retrieval of large quantities of text. The work in the TREC-2 environment continues, performing both routing and ad hoc experiments. The ad hoc work extends investigations into combining global similarities, giving an overall indication of how a document matches a query, with local similarities identifying a smaller part of the document that matches the query. The performance of ad hoc runs is good, but it is clear that full advantage of the available local information is not been taken advantage of. The routing experiments use conventional relevance feedback approaches to routing, but with a much greater degree of query expansion than was previously done. The length of a query vector is increased by a factor of 5 to 10 by adding terms found in previously seen relevant documents. This approach improves effectiveness by 30-40% over the original query

Source

Information processing and management. 31(1995) no.3, S.315-326

Salton, G.; Allan, J.; Buckley, C.; Singhal, A.: Automatic analysis, theme generation, and summarization of machine readable texts (1994) 0.00

0.0029745363 = product of:
  0.011898145 = sum of:
    0.011898145 = weight(_text_:information in 1949) [ClassicSimilarity], result of:
      0.011898145 = score(doc=1949,freq=2.0), product of:
        0.06134496 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.034944877 = queryNorm
        0.19395474 = fieldWeight in 1949, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.078125 = fieldNorm(doc=1949)
  0.25 = coord(1/4)

Footnote: Wiederabgedruckt in: Readings in information retrieval. Ed.: K. Sparck Jones u. P. Willett. San Francisco: Morgan Kaufmann 1997. S.478-483.

Salton, G.; Buckley, C.; Allan, J.: Automatic structuring of text files (1992) 0.00
```
0.002379629 = product of:
  0.009518516 = sum of:
    0.009518516 = weight(_text_:information in 6507) [ClassicSimilarity], result of:
      0.009518516 = score(doc=6507,freq=2.0), product of:
        0.06134496 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.034944877 = queryNorm
        0.1551638 = fieldWeight in 6507, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0625 = fieldNorm(doc=6507)
  0.25 = coord(1/4)
```
Abstract

In many practical information retrieval situations, it is necessary to process heterogeneous text databases that vary greatly in scope and coverage and deal with many different subjects. In such an environment it is important to provide flexible access to individual text pieces and to structure the collection so that related text elements are identified and properly linked. Describes methods for the automatic structuring of heterogeneous text collections and the construction of browsing tools and access procedures that facilitate collection use. Illustrates these emthods with searches using a large automated encyclopedia

Salton, G.; Allan, J.; Singhal, A.: Automatic text decomposition and structuring (1996) 0.00

0.002379629 = product of:
  0.009518516 = sum of:
    0.009518516 = weight(_text_:information in 4067) [ClassicSimilarity], result of:
      0.009518516 = score(doc=4067,freq=2.0), product of:
        0.06134496 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.034944877 = queryNorm
        0.1551638 = fieldWeight in 4067, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0625 = fieldNorm(doc=4067)
  0.25 = coord(1/4)

Source: Information processing and management. 32(1996) no.2, S.127-138

Search (4 results, page 1 of 1)

Authors

Themes