Lund, K.; Burgess, C.: Producing high-dimensional semantic spaces from lexical co-occurrence (1996)
0.00
0.0031061543 = product of:
0.012424617 = sum of:
0.012424617 = weight(_text_:information in 1704) [ClassicSimilarity], result of:
0.012424617 = score(doc=1704,freq=6.0), product of:
0.06164115 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0351136 = queryNorm
0.20156369 = fieldWeight in 1704, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.046875 = fieldNorm(doc=1704)
0.25 = coord(1/4)
- Abstract
- A procedure that processes a corpus of text and produces numeric vectors containing information about its meanings for each word is presented. This procedure is applied to a large corpus of natural language text taken from Usenet, and the resulting vectors are examined to determine what information is contained within them. These vectors provide the coordinates in a high-dimensional space in which word relationships can be analyzed. Analyses of both vector similarity and multidimensional scaling demonstrate that there is significant semantic information carried in the vectors. A comparison of vector similarity with human reaction times in a single-word priming experiment is presented. These vectors provide the basis for a representational model of semantic memory, hyperspace analogue to language (HAL).