-
Salton, G.; Buckley, C.: Term-weighting approaches in automatic text retrieval (1988)
0.00
9.2971756E-4 = product of:
0.013945763 = sum of:
0.009165013 = weight(_text_:in in 1938) [ClassicSimilarity], result of:
0.009165013 = score(doc=1938,freq=6.0), product of:
0.029340398 = queryWeight, product of:
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.021569785 = queryNorm
0.3123684 = fieldWeight in 1938, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.09375 = fieldNorm(doc=1938)
0.00478075 = weight(_text_:s in 1938) [ClassicSimilarity], result of:
0.00478075 = score(doc=1938,freq=4.0), product of:
0.023451481 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.021569785 = queryNorm
0.20385705 = fieldWeight in 1938, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.09375 = fieldNorm(doc=1938)
0.06666667 = coord(2/30)
- Footnote
- Wiederabgedruckt in: Readings in information retrieval. Ed.: K. Sparck Jones u. P. Willett. San Francisco: Morgan Kaufmann 1997. S.323-328.
- Source
- Information processing and management. 24(1988) no.5, S.513-523
-
Salton, G.; Allan, J.; Buckley, C.; Singhal, A.: Automatic analysis, theme generation, and summarization of machine readable texts (1994)
0.00
6.813307E-4 = product of:
0.01021996 = sum of:
0.006236001 = weight(_text_:in in 1949) [ClassicSimilarity], result of:
0.006236001 = score(doc=1949,freq=4.0), product of:
0.029340398 = queryWeight, product of:
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.021569785 = queryNorm
0.21253976 = fieldWeight in 1949, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.078125 = fieldNorm(doc=1949)
0.003983958 = weight(_text_:s in 1949) [ClassicSimilarity], result of:
0.003983958 = score(doc=1949,freq=4.0), product of:
0.023451481 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.021569785 = queryNorm
0.16988087 = fieldWeight in 1949, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.078125 = fieldNorm(doc=1949)
0.06666667 = coord(2/30)
- Footnote
- Wiederabgedruckt in: Readings in information retrieval. Ed.: K. Sparck Jones u. P. Willett. San Francisco: Morgan Kaufmann 1997. S.478-483.
- Source
- Science. 264(1994), S.1421-1426
-
Salton, G.; Buckley, C.; Smith, M.: On the application of syntactic methodologies in automatic text analysis (1990)
0.00
6.7448284E-4 = product of:
0.010117242 = sum of:
0.0061733257 = weight(_text_:in in 7864) [ClassicSimilarity], result of:
0.0061733257 = score(doc=7864,freq=2.0), product of:
0.029340398 = queryWeight, product of:
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.021569785 = queryNorm
0.21040362 = fieldWeight in 7864, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.109375 = fieldNorm(doc=7864)
0.003943917 = weight(_text_:s in 7864) [ClassicSimilarity], result of:
0.003943917 = score(doc=7864,freq=2.0), product of:
0.023451481 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.021569785 = queryNorm
0.16817348 = fieldWeight in 7864, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.109375 = fieldNorm(doc=7864)
0.06666667 = coord(2/30)
- Source
- Information processing and management. 26(1990) no.1, S.73-92
-
Buckley, C.; Voorhees, E.M.: Retrieval system evaluation (2005)
0.00
6.7448284E-4 = product of:
0.010117242 = sum of:
0.0061733257 = weight(_text_:in in 648) [ClassicSimilarity], result of:
0.0061733257 = score(doc=648,freq=2.0), product of:
0.029340398 = queryWeight, product of:
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.021569785 = queryNorm
0.21040362 = fieldWeight in 648, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.109375 = fieldNorm(doc=648)
0.003943917 = weight(_text_:s in 648) [ClassicSimilarity], result of:
0.003943917 = score(doc=648,freq=2.0), product of:
0.023451481 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.021569785 = queryNorm
0.16817348 = fieldWeight in 648, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.109375 = fieldNorm(doc=648)
0.06666667 = coord(2/30)
- Pages
- S.53-78
- Source
- TREC: experiment and evaluation in information retrieval. Ed.: E.M. Voorhees, u. D.K. Harman
-
Buckley, C.; Voorhees, E.M.: Retrieval evaluation with incomplete information (2004)
0.00
5.781282E-4 = product of:
0.008671923 = sum of:
0.0052914224 = weight(_text_:in in 4127) [ClassicSimilarity], result of:
0.0052914224 = score(doc=4127,freq=2.0), product of:
0.029340398 = queryWeight, product of:
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.021569785 = queryNorm
0.18034597 = fieldWeight in 4127, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.09375 = fieldNorm(doc=4127)
0.0033805002 = weight(_text_:s in 4127) [ClassicSimilarity], result of:
0.0033805002 = score(doc=4127,freq=2.0), product of:
0.023451481 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.021569785 = queryNorm
0.14414869 = fieldWeight in 4127, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.09375 = fieldNorm(doc=4127)
0.06666667 = coord(2/30)
- Pages
- S.25-32
- Source
- SIGIR'04: Proceedings of the 27th Annual International ACM-SIGIR Conference an Research and Development in Information Retrieval. Ed.: K. Järvelin, u.a
-
Buckley, C.: ¬The SMART Project at TREC (2005)
0.00
5.781282E-4 = product of:
0.008671923 = sum of:
0.0052914224 = weight(_text_:in in 5088) [ClassicSimilarity], result of:
0.0052914224 = score(doc=5088,freq=2.0), product of:
0.029340398 = queryWeight, product of:
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.021569785 = queryNorm
0.18034597 = fieldWeight in 5088, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.09375 = fieldNorm(doc=5088)
0.0033805002 = weight(_text_:s in 5088) [ClassicSimilarity], result of:
0.0033805002 = score(doc=5088,freq=2.0), product of:
0.023451481 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.021569785 = queryNorm
0.14414869 = fieldWeight in 5088, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.09375 = fieldNorm(doc=5088)
0.06666667 = coord(2/30)
- Pages
- S.301-320
- Source
- TREC: experiment and evaluation in information retrieval. Ed.: E.M. Voorhees, u. D.K. Harman
-
Salton, G.; Buckley, C.; Allan, J.: Automatic structuring of text files (1992)
0.00
5.575784E-4 = product of:
0.008363675 = sum of:
0.006110009 = weight(_text_:in in 6507) [ClassicSimilarity], result of:
0.006110009 = score(doc=6507,freq=6.0), product of:
0.029340398 = queryWeight, product of:
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.021569785 = queryNorm
0.2082456 = fieldWeight in 6507, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.0625 = fieldNorm(doc=6507)
0.002253667 = weight(_text_:s in 6507) [ClassicSimilarity], result of:
0.002253667 = score(doc=6507,freq=2.0), product of:
0.023451481 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.021569785 = queryNorm
0.09609913 = fieldWeight in 6507, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.0625 = fieldNorm(doc=6507)
0.06666667 = coord(2/30)
- Abstract
- In many practical information retrieval situations, it is necessary to process heterogeneous text databases that vary greatly in scope and coverage and deal with many different subjects. In such an environment it is important to provide flexible access to individual text pieces and to structure the collection so that related text elements are identified and properly linked. Describes methods for the automatic structuring of heterogeneous text collections and the construction of browsing tools and access procedures that facilitate collection use. Illustrates these emthods with searches using a large automated encyclopedia
- Source
- Electronic publishing. 5(1992) no.1, S.1-17
-
Salton, G.; Buckley, C.: Approaches to global text analysis (1990)
0.00
5.43019E-4 = product of:
0.008145284 = sum of:
0.0061733257 = weight(_text_:in in 4901) [ClassicSimilarity], result of:
0.0061733257 = score(doc=4901,freq=8.0), product of:
0.029340398 = queryWeight, product of:
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.021569785 = queryNorm
0.21040362 = fieldWeight in 4901, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.0546875 = fieldNorm(doc=4901)
0.0019719584 = weight(_text_:s in 4901) [ClassicSimilarity], result of:
0.0019719584 = score(doc=4901,freq=2.0), product of:
0.023451481 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.021569785 = queryNorm
0.08408674 = fieldWeight in 4901, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.0546875 = fieldNorm(doc=4901)
0.06666667 = coord(2/30)
- Abstract
- Current approaches to the analysis of natural language text are not viable for documents of unrestricted scope. A global text analysis system is proposed designed to identify homogeneous text environments in which the meaning of text words and phrases remains unambiguous, and useful term relationships may be automatically determined. The proposed methods include document clustering methods, as well as comparisons of local document excerpts in specified global contexts, leading to structured text representations in which similar texts, or text excerpts, are appropriately linked
- Pages
- S.228-233
- Source
- ASIS'90: Information in the year 2000, from research to applications. Proc. of the 53rd Annual Meeting of the American Society for Information Science, Toronto, Canada, 4.-8.11.1990. Ed. by Diana Henderson
-
Buckley, C.; Allan, J.; Salton, G.: Automatic routing and retrieval using Smart : TREC-2 (1995)
0.00
4.181838E-4 = product of:
0.0062727565 = sum of:
0.0045825066 = weight(_text_:in in 5699) [ClassicSimilarity], result of:
0.0045825066 = score(doc=5699,freq=6.0), product of:
0.029340398 = queryWeight, product of:
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.021569785 = queryNorm
0.1561842 = fieldWeight in 5699, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.046875 = fieldNorm(doc=5699)
0.0016902501 = weight(_text_:s in 5699) [ClassicSimilarity], result of:
0.0016902501 = score(doc=5699,freq=2.0), product of:
0.023451481 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.021569785 = queryNorm
0.072074346 = fieldWeight in 5699, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.046875 = fieldNorm(doc=5699)
0.06666667 = coord(2/30)
- Abstract
- The Smart information retrieval project emphazises completely automatic approaches to the understanding and retrieval of large quantities of text. The work in the TREC-2 environment continues, performing both routing and ad hoc experiments. The ad hoc work extends investigations into combining global similarities, giving an overall indication of how a document matches a query, with local similarities identifying a smaller part of the document that matches the query. The performance of ad hoc runs is good, but it is clear that full advantage of the available local information is not been taken advantage of. The routing experiments use conventional relevance feedback approaches to routing, but with a much greater degree of query expansion than was previously done. The length of a query vector is increased by a factor of 5 to 10 by adding terms found in previously seen relevant documents. This approach improves effectiveness by 30-40% over the original query
- Source
- Information processing and management. 31(1995) no.3, S.315-326
- Theme
- Semantisches Umfeld in Indexierung u. Retrieval
-
Salton, G.; Buckley, C.: Parallel text search methods (1988)
0.00
1.5024448E-4 = product of:
0.004507334 = sum of:
0.004507334 = weight(_text_:s in 404) [ClassicSimilarity], result of:
0.004507334 = score(doc=404,freq=2.0), product of:
0.023451481 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.021569785 = queryNorm
0.19219826 = fieldWeight in 404, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.125 = fieldNorm(doc=404)
0.033333335 = coord(1/30)
- Source
- Communications of the Association for Computing Machinery. 31(1988), S.205-215
-
Salton, G.; Allen, J.; Buckley, C.; Singhal, A.: Automatic analysis, theme generation, and summarization of machine-readable data (1994)
0.00
1.3146391E-4 = product of:
0.003943917 = sum of:
0.003943917 = weight(_text_:s in 1168) [ClassicSimilarity], result of:
0.003943917 = score(doc=1168,freq=2.0), product of:
0.023451481 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.021569785 = queryNorm
0.16817348 = fieldWeight in 1168, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.109375 = fieldNorm(doc=1168)
0.033333335 = coord(1/30)
- Source
- Science. 264(1994) no.5164, S.1421-1426
-
Buckley, C.; Singhal, A.; Mitra, M.; Salton, G.: New retrieval approaches using SMART : TREC 4 (1996)
0.00
1.12683345E-4 = product of:
0.0033805002 = sum of:
0.0033805002 = weight(_text_:s in 7528) [ClassicSimilarity], result of:
0.0033805002 = score(doc=7528,freq=2.0), product of:
0.023451481 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.021569785 = queryNorm
0.14414869 = fieldWeight in 7528, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.09375 = fieldNorm(doc=7528)
0.033333335 = coord(1/30)
- Pages
- S.25-48
-
Singhal, A.; Buckley, C.; Mitra, M.: Using query zoning and correlation with SMART : TREC 5 (1997)
0.00
1.12683345E-4 = product of:
0.0033805002 = sum of:
0.0033805002 = weight(_text_:s in 3090) [ClassicSimilarity], result of:
0.0033805002 = score(doc=3090,freq=2.0), product of:
0.023451481 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.021569785 = queryNorm
0.14414869 = fieldWeight in 3090, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.09375 = fieldNorm(doc=3090)
0.033333335 = coord(1/30)
- Pages
- S.25-48
-
Wong, S.K.M.; Yao, Y.Y.; Salton, G.; Buckley, C.: Evaluation of an adaptive linear model (1991)
0.00
7.512224E-5 = product of:
0.002253667 = sum of:
0.002253667 = weight(_text_:s in 4836) [ClassicSimilarity], result of:
0.002253667 = score(doc=4836,freq=2.0), product of:
0.023451481 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.021569785 = queryNorm
0.09609913 = fieldWeight in 4836, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.0625 = fieldNorm(doc=4836)
0.033333335 = coord(1/30)
- Source
- Journal of the American Society for Information Science. 42(1991) no.10, S.723-730
-
Salton, G.; Buckley, C.: Improving retrieval performance by relevance feedback (1990)
0.00
7.512224E-5 = product of:
0.002253667 = sum of:
0.002253667 = weight(_text_:s in 5442) [ClassicSimilarity], result of:
0.002253667 = score(doc=5442,freq=2.0), product of:
0.023451481 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.021569785 = queryNorm
0.09609913 = fieldWeight in 5442, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.0625 = fieldNorm(doc=5442)
0.033333335 = coord(1/30)
- Source
- Journal of the American Society for Information Science. 41(1990) no.4, S.288-297