Search (9 results, page 1 of 1)

Lalmas, M.: XML retrieval (2009) 0.04
```
0.04341299 = product of:
  0.08682598 = sum of:
    0.05945961 = weight(_text_:description in 4998) [ClassicSimilarity], result of:
      0.05945961 = score(doc=4998,freq=2.0), product of:
        0.23150103 = queryWeight, product of:
          4.64937 = idf(docFreq=1149, maxDocs=44218)
          0.04979191 = queryNorm
        0.25684384 = fieldWeight in 4998, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.64937 = idf(docFreq=1149, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4998)
    0.027366372 = product of:
      0.054732744 = sum of:
        0.054732744 = weight(_text_:access in 4998) [ClassicSimilarity], result of:
          0.054732744 = score(doc=4998,freq=6.0), product of:
            0.16876608 = queryWeight, product of:
              3.389428 = idf(docFreq=4053, maxDocs=44218)
              0.04979191 = queryNorm
            0.3243113 = fieldWeight in 4998, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.389428 = idf(docFreq=4053, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4998)
      0.5 = coord(1/2)
  0.5 = coord(2/4)
```
Abstract

Documents usually have a content and a structure. The content refers to the text of the document, whereas the structure refers to how a document is logically organized. An increasingly common way to encode the structure is through the use of a mark-up language. Nowadays, the most widely used mark-up language for representing structure is the eXtensible Mark-up Language (XML). XML can be used to provide a focused access to documents, i.e. returning XML elements, such as sections and paragraphs, instead of whole documents in response to a query. Such focused strategies are of particular benefit for information repositories containing long documents, or documents covering a wide variety of topics, where users are directed to the most relevant content within a document. The increased adoption of XML to represent a document structure requires the development of tools to effectively access documents marked-up in XML. This book provides a detailed description of query languages, indexing strategies, ranking algorithms, presentation scenarios developed to access XML documents. Major advances in XML retrieval were seen from 2002 as a result of INEX, the Initiative for Evaluation of XML Retrieval. INEX, also described in this book, provided test sets for evaluating XML retrieval effectiveness. Many of the developments and results described in this book were investigated within INEX.
Lalmas, M.; Ruthven, I.: Representing and retrieving structured documents using the Dempster-Shafer theory of evidence : modelling and evaluation (1998) 0.02
```
0.020810865 = product of:
  0.08324346 = sum of:
    0.08324346 = weight(_text_:description in 1076) [ClassicSimilarity], result of:
      0.08324346 = score(doc=1076,freq=2.0), product of:
        0.23150103 = queryWeight, product of:
          4.64937 = idf(docFreq=1149, maxDocs=44218)
          0.04979191 = queryNorm
        0.35958138 = fieldWeight in 1076, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.64937 = idf(docFreq=1149, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1076)
  0.25 = coord(1/4)
```
Abstract

Reports on a theoretical model of structured document indexing and retrieval based on the Dempster-Schafer Theory of Evidence. Includes a description of the model of structured document retrieval, the representation of structured documents, the representation of individual components, how components are combined, details of the combination process, and how relevance is captured within the model. Also presents a detailed account of an implementation of the model, and an evaluation scheme designed to test the effectiveness of the model

Ruthven, I.; Lalmas, M.: Selective relevance feedback using term characteristics (1999) 0.02

0.017152525 = product of:
  0.0686101 = sum of:
    0.0686101 = weight(_text_:26 in 3824) [ClassicSimilarity], result of:
      0.0686101 = score(doc=3824,freq=2.0), product of:
        0.17584132 = queryWeight, product of:
          3.5315237 = idf(docFreq=3516, maxDocs=44218)
          0.04979191 = queryNorm
        0.3901819 = fieldWeight in 3824, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5315237 = idf(docFreq=3516, maxDocs=44218)
          0.078125 = fieldNorm(doc=3824)
  0.25 = coord(1/4)

Source: Vocabulary as a central concept in digital libraries: interdisciplinary concepts, challenges, and opportunities : proceedings of the Third International Conference an Conceptions of Library and Information Science (COLIS3), Dubrovnik, Croatia, 23-26 May 1999. Ed. by T. Arpanac et al

Arapakis, I.; Lalmas, M.; Ceylan, H.; Donmez, P.: Automatically embedding newsworthy links to articles : from implementation to evaluation (2014) 0.01

0.008576263 = product of:
  0.03430505 = sum of:
    0.03430505 = weight(_text_:26 in 1185) [ClassicSimilarity], result of:
      0.03430505 = score(doc=1185,freq=2.0), product of:
        0.17584132 = queryWeight, product of:
          3.5315237 = idf(docFreq=3516, maxDocs=44218)
          0.04979191 = queryNorm
        0.19509095 = fieldWeight in 1185, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5315237 = idf(docFreq=3516, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1185)
  0.25 = coord(1/4)

Date: 26. 1.2014 20:32:06

Reid, J.; Lalmas, M.; Finesilver, K.; Hertzum, M.: Best entry points for structured document retrieval : part I: characteristics (2006) 0.01
```
0.007820592 = product of:
  0.03128237 = sum of:
    0.03128237 = product of:
      0.06256474 = sum of:
        0.06256474 = weight(_text_:access in 960) [ClassicSimilarity], result of:
          0.06256474 = score(doc=960,freq=4.0), product of:
            0.16876608 = queryWeight, product of:
              3.389428 = idf(docFreq=4053, maxDocs=44218)
              0.04979191 = queryNorm
            0.3707187 = fieldWeight in 960, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.389428 = idf(docFreq=4053, maxDocs=44218)
              0.0546875 = fieldNorm(doc=960)
      0.5 = coord(1/2)
  0.25 = coord(1/4)
```
Abstract

Structured document retrieval makes use of document components as the basis of the retrieval process, rather than complete documents. The inherent relationships between these components make it vital to support users' natural browsing behaviour in order to offer effective and efficient access to structured documents. This paper examines the concept of best entry points, which are document components from which the user can browse to obtain optimal access to relevant document components. In particular this paper investigates the basic characteristics of best entry points.
Reid, J.; Lalmas, M.; Finesilver, K.; Hertzum, M.: Best entry points for structured document retrieval : part II: types, usage and effectiveness (2006) 0.01
```
0.007820592 = product of:
  0.03128237 = sum of:
    0.03128237 = product of:
      0.06256474 = sum of:
        0.06256474 = weight(_text_:access in 961) [ClassicSimilarity], result of:
          0.06256474 = score(doc=961,freq=4.0), product of:
            0.16876608 = queryWeight, product of:
              3.389428 = idf(docFreq=4053, maxDocs=44218)
              0.04979191 = queryNorm
            0.3707187 = fieldWeight in 961, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.389428 = idf(docFreq=4053, maxDocs=44218)
              0.0546875 = fieldNorm(doc=961)
      0.5 = coord(1/2)
  0.25 = coord(1/4)
```
Abstract

Structured document retrieval makes use of document components as the basis of the retrieval process, rather than complete documents. The inherent relationships between these components make it vital to support users' natural browsing behaviour in order to offer effective and efficient access to structured documents. This paper examines the concept of best entry points, which are document components from which the user can browse to obtain optimal access to relevant document components. It investigates at the types of best entry points in structured document retrieval, and their usage and effectiveness in real information search tasks.
Piwowarski, B.; Amini, M.R.; Lalmas, M.: On using a quantum physics formalism for multidocument summarization (2012) 0.01
```
0.006841593 = product of:
  0.027366372 = sum of:
    0.027366372 = product of:
      0.054732744 = sum of:
        0.054732744 = weight(_text_:access in 236) [ClassicSimilarity], result of:
          0.054732744 = score(doc=236,freq=6.0), product of:
            0.16876608 = queryWeight, product of:
              3.389428 = idf(docFreq=4053, maxDocs=44218)
              0.04979191 = queryNorm
            0.3243113 = fieldWeight in 236, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.389428 = idf(docFreq=4053, maxDocs=44218)
              0.0390625 = fieldNorm(doc=236)
      0.5 = coord(1/2)
  0.25 = coord(1/4)
```
Abstract

Multidocument summarization (MDS) aims for each given query to extract compressed and relevant information with respect to the different query-related themes present in a set of documents. Many approaches operate in two steps. Themes are first identified from the set, and then a summary is formed by extracting salient sentences within the different documents of each of the identified themes. Among these approaches, latent semantic analysis (LSA) based approaches rely on spectral decomposition techniques to identify the themes. In this article, we propose a major extension of these techniques that relies on the quantum information access (QIA) framework. The latter is a framework developed for modeling information access based on the probabilistic formalism of quantum physics. The QIA framework not only points out the limitations of the current LSA-based approaches, but motivates a new principled criterium to tackle multidocument summarization that addresses these limitations. As a byproduct, it also provides a way to enhance the LSA-based approaches. Extensive experiments on the DUC 2005, 2006 and 2007 datasets show that the proposed approach consistently improves over both the LSA-based approaches and the systems that competed in the yearly DUC competitions. This demonstrates the potential impact of quantum-inspired approaches to information access in general, and of the QIA framework in particular.

Lalmas, M.: XML information retrieval (2009) 0.01

0.005529994 = product of:
  0.022119977 = sum of:
    0.022119977 = product of:
      0.044239953 = sum of:
        0.044239953 = weight(_text_:access in 3880) [ClassicSimilarity], result of:
          0.044239953 = score(doc=3880,freq=2.0), product of:
            0.16876608 = queryWeight, product of:
              3.389428 = idf(docFreq=4053, maxDocs=44218)
              0.04979191 = queryNorm
            0.2621377 = fieldWeight in 3880, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.389428 = idf(docFreq=4053, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3880)
      0.5 = coord(1/2)
  0.25 = coord(1/4)

Abstract: Nowadays, increasingly, documents are marked-up using eXtensible Mark-up Language (XML), the format standard for structured documents. In contrast to HTML, which is mainly layout-oriented, XML follows the fundamental concept of separating the logical structure of a document from its layout. This document logical structure can be exploited to allow a focused access to documents, where the aim is to return the most relevant fragments within documents as answers to queries, instead of whole documents. This entry describes approaches developed to query, represent, and rank XML fragments.

Crestani, F.; Dominich, S.; Lalmas, M.; Rijsbergen, C.J.K. van: Mathematical, logical, and formal methods in information retrieval : an introduction to the special issue (2003) 0.01

0.005059587 = product of:
  0.020238347 = sum of:
    0.020238347 = product of:
      0.040476695 = sum of:
        0.040476695 = weight(_text_:22 in 1451) [ClassicSimilarity], result of:
          0.040476695 = score(doc=1451,freq=2.0), product of:
            0.17436278 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04979191 = queryNorm
            0.23214069 = fieldWeight in 1451, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=1451)
      0.5 = coord(1/2)
  0.25 = coord(1/4)

Date: 22. 3.2003 19:27:36

Search (9 results, page 1 of 1)

Authors

Years

Types

Subjects

Classifications