-
Kazai, G.; Lalmas, M.: ¬The overlap problem in content-oriented XML retrieval evaluation (2004)
0.01
0.011166574 = product of:
0.027916435 = sum of:
0.016630089 = product of:
0.08315045 = sum of:
0.08315045 = weight(_text_:problem in 4083) [ClassicSimilarity], result of:
0.08315045 = score(doc=4083,freq=2.0), product of:
0.17731056 = queryWeight, product of:
4.244485 = idf(docFreq=1723, maxDocs=44218)
0.04177434 = queryNorm
0.46895373 = fieldWeight in 4083, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.244485 = idf(docFreq=1723, maxDocs=44218)
0.078125 = fieldNorm(doc=4083)
0.2 = coord(1/5)
0.011286346 = weight(_text_:of in 4083) [ClassicSimilarity], result of:
0.011286346 = score(doc=4083,freq=2.0), product of:
0.06532493 = queryWeight, product of:
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.04177434 = queryNorm
0.17277241 = fieldWeight in 4083, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.078125 = fieldNorm(doc=4083)
0.4 = coord(2/5)
- Source
- SIGIR'04: Proceedings of the 27th Annual International ACM-SIGIR Conference an Research and Development in Information Retrieval. Ed.: K. Järvelin, u.a
-
Kazai, G.; Lalmas, M.; Fuhr, N.; Gövert, N.: ¬A report an the first year of the INitiative for the Evaluation of XML Retrieval (INEX'02) (2004)
0.01
0.0052405605 = product of:
0.026202802 = sum of:
0.026202802 = weight(_text_:of in 2267) [ClassicSimilarity], result of:
0.026202802 = score(doc=2267,freq=22.0), product of:
0.06532493 = queryWeight, product of:
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.04177434 = queryNorm
0.40111488 = fieldWeight in 2267, product of:
4.690416 = tf(freq=22.0), with freq of:
22.0 = termFreq=22.0
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.0546875 = fieldNorm(doc=2267)
0.2 = coord(1/5)
- Abstract
- The INitiative for the Evaluation of XML retrieval (INEX) aims at providing an infrastructure to evaluate the effectiveness of content-oriented XML retrieval systems. To this end, in the first round of INEX in 2002, a test collection of real world XML documents along with a set of topics and respective relevance assessments have been created with the collaboration of 36 participating organizations. In this article, we provide an overview of the first round of the INEX initiative.
- Source
- Journal of the American Society for Information Science and Technology. 55(2004) no.6, S.551-556
-
Rölleke, T.; Tsikrika, T.; Kazai, G.: ¬A general matrix framework for modelling Information Retrieval (2006)
0.00
0.0029860863 = product of:
0.014930432 = sum of:
0.014930432 = weight(_text_:of in 957) [ClassicSimilarity], result of:
0.014930432 = score(doc=957,freq=14.0), product of:
0.06532493 = queryWeight, product of:
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.04177434 = queryNorm
0.22855641 = fieldWeight in 957, product of:
3.7416575 = tf(freq=14.0), with freq of:
14.0 = termFreq=14.0
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.0390625 = fieldNorm(doc=957)
0.2 = coord(1/5)
- Abstract
- In this paper, we present a well-defined general matrix framework for modelling Information Retrieval (IR). In this framework, collections, documents and queries correspond to matrix spaces. Retrieval aspects, such as content, structure and semantics, are expressed by matrices defined in these spaces and by matrix operations applied on them. The dualities of these spaces are identified through the application of frequency-based operations on the proposed matrices and through the investigation of the meaning of their eigenvectors. This allows term weighting concepts used for content-based retrieval, such as term frequency and inverse document frequency, to translate directly to concepts for structure-based retrieval. In addition, concepts such as pagerank, authorities and hubs, determined by exploiting the structural relationships between linked documents, can be defined with respect to the semantic relationships between terms. Moreover, this mathematical framework can be used to express classical and alternative evaluation measures, involving, for instance, the structure of documents, and to further explain and relate IR models and theory. The high level of reusability and abstraction of the framework leads to a logical layer for IR that makes system design and construction significantly more efficient, and thus, better and increasingly personalised systems can be built at lower costs.