Search (415 results, page 1 of 21)

Xiong, C.: Knowledge based text representations for information retrieval (2016) 0.57

0.5686467 = product of:
  0.7108084 = sum of:
    0.032562904 = product of:
      0.097688705 = sum of:
        0.097688705 = weight(_text_:3a in 5820) [ClassicSimilarity], result of:
          0.097688705 = score(doc=5820,freq=2.0), product of:
            0.2607266 = queryWeight, product of:
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.030753274 = queryNorm
            0.3746787 = fieldWeight in 5820, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.03125 = fieldNorm(doc=5820)
      0.33333334 = coord(1/3)
    0.13815269 = weight(_text_:2f in 5820) [ClassicSimilarity], result of:
      0.13815269 = score(doc=5820,freq=4.0), product of:
        0.2607266 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.030753274 = queryNorm
        0.5298757 = fieldWeight in 5820, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.03125 = fieldNorm(doc=5820)
    0.13815269 = weight(_text_:2f in 5820) [ClassicSimilarity], result of:
      0.13815269 = score(doc=5820,freq=4.0), product of:
        0.2607266 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.030753274 = queryNorm
        0.5298757 = fieldWeight in 5820, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.03125 = fieldNorm(doc=5820)
    0.011846555 = weight(_text_:information in 5820) [ClassicSimilarity], result of:
      0.011846555 = score(doc=5820,freq=16.0), product of:
        0.05398669 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.030753274 = queryNorm
        0.21943474 = fieldWeight in 5820, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03125 = fieldNorm(doc=5820)
    0.024872115 = weight(_text_:retrieval in 5820) [ClassicSimilarity], result of:
      0.024872115 = score(doc=5820,freq=8.0), product of:
        0.093026035 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.030753274 = queryNorm
        0.26736724 = fieldWeight in 5820, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.03125 = fieldNorm(doc=5820)
    0.13815269 = weight(_text_:2f in 5820) [ClassicSimilarity], result of:
      0.13815269 = score(doc=5820,freq=4.0), product of:
        0.2607266 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.030753274 = queryNorm
        0.5298757 = fieldWeight in 5820, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.03125 = fieldNorm(doc=5820)
    0.088916 = weight(_text_:ranking in 5820) [ClassicSimilarity], result of:
      0.088916 = score(doc=5820,freq=10.0), product of:
        0.16634533 = queryWeight, product of:
          5.4090285 = idf(docFreq=537, maxDocs=44218)
          0.030753274 = queryNorm
        0.5345266 = fieldWeight in 5820, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          5.4090285 = idf(docFreq=537, maxDocs=44218)
          0.03125 = fieldNorm(doc=5820)
    0.13815269 = weight(_text_:2f in 5820) [ClassicSimilarity], result of:
      0.13815269 = score(doc=5820,freq=4.0), product of:
        0.2607266 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.030753274 = queryNorm
        0.5298757 = fieldWeight in 5820, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.03125 = fieldNorm(doc=5820)
  0.8 = coord(8/10)

Abstract: The successes of information retrieval (IR) in recent decades were built upon bag-of-words representations. Effective as it is, bag-of-words is only a shallow text understanding; there is a limited amount of information for document ranking in the word space. This dissertation goes beyond words and builds knowledge based text representations, which embed the external and carefully curated information from knowledge bases, and provide richer and structured evidence for more advanced information retrieval systems. This thesis research first builds query representations with entities associated with the query. Entities' descriptions are used by query expansion techniques that enrich the query with explanation terms. Then we present a general framework that represents a query with entities that appear in the query, are retrieved by the query, or frequently show up in the top retrieved documents. A latent space model is developed to jointly learn the connections from query to entities and the ranking of documents, modeling the external evidence from knowledge bases and internal ranking features cooperatively. To further improve the quality of relevant entities, a defining factor of our query representations, we introduce learning to rank to entity search and retrieve better entities from knowledge bases. In the document representation part, this thesis research also moves one step forward with a bag-of-entities model, in which documents are represented by their automatic entity annotations, and the ranking is performed in the entity space.
This proposal includes plans to improve the quality of relevant entities with a co-learning framework that learns from both entity labels and document labels. We also plan to develop a hybrid ranking system that combines word based and entity based representations together with their uncertainties considered. At last, we plan to enrich the text representations with connections between entities. We propose several ways to infer entity graph representations for texts, and to rank documents using their structure representations. This dissertation overcomes the limitation of word based representations with external and carefully curated information from knowledge bases. We believe this thesis research is a solid start towards the new generation of intelligent, semantic, and structured information retrieval.
Content: Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Language and Information Technologies. Vgl.: https%3A%2F%2Fwww.cs.cmu.edu%2F~cx%2Fpapers%2Fknowledge_based_text_representation.pdf&usg=AOvVaw0SaTSvhWLTh__Uz_HtOtl3.

Stojanovic, N.: Ontology-based Information Retrieval : methods and tools for cooperative query answering (2005) 0.40

0.39945948 = product of:
  0.49932435 = sum of:
    0.032562904 = product of:
      0.097688705 = sum of:
        0.097688705 = weight(_text_:3a in 701) [ClassicSimilarity], result of:
          0.097688705 = score(doc=701,freq=2.0), product of:
            0.2607266 = queryWeight, product of:
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.030753274 = queryNorm
            0.3746787 = fieldWeight in 701, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.03125 = fieldNorm(doc=701)
      0.33333334 = coord(1/3)
    0.097688705 = weight(_text_:2f in 701) [ClassicSimilarity], result of:
      0.097688705 = score(doc=701,freq=2.0), product of:
        0.2607266 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.030753274 = queryNorm
        0.3746787 = fieldWeight in 701, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.03125 = fieldNorm(doc=701)
    0.097688705 = weight(_text_:2f in 701) [ClassicSimilarity], result of:
      0.097688705 = score(doc=701,freq=2.0), product of:
        0.2607266 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.030753274 = queryNorm
        0.3746787 = fieldWeight in 701, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.03125 = fieldNorm(doc=701)
    0.012565169 = weight(_text_:information in 701) [ClassicSimilarity], result of:
      0.012565169 = score(doc=701,freq=18.0), product of:
        0.05398669 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.030753274 = queryNorm
        0.23274568 = fieldWeight in 701, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03125 = fieldNorm(doc=701)
    0.046531465 = weight(_text_:retrieval in 701) [ClassicSimilarity], result of:
      0.046531465 = score(doc=701,freq=28.0), product of:
        0.093026035 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.030753274 = queryNorm
        0.5001983 = fieldWeight in 701, product of:
          5.2915025 = tf(freq=28.0), with freq of:
            28.0 = termFreq=28.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.03125 = fieldNorm(doc=701)
    0.097688705 = weight(_text_:2f in 701) [ClassicSimilarity], result of:
      0.097688705 = score(doc=701,freq=2.0), product of:
        0.2607266 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.030753274 = queryNorm
        0.3746787 = fieldWeight in 701, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.03125 = fieldNorm(doc=701)
    0.097688705 = weight(_text_:2f in 701) [ClassicSimilarity], result of:
      0.097688705 = score(doc=701,freq=2.0), product of:
        0.2607266 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.030753274 = queryNorm
        0.3746787 = fieldWeight in 701, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.03125 = fieldNorm(doc=701)
    0.01690999 = product of:
      0.03381998 = sum of:
        0.03381998 = weight(_text_:evaluation in 701) [ClassicSimilarity], result of:
          0.03381998 = score(doc=701,freq=4.0), product of:
            0.12900078 = queryWeight, product of:
              4.1947007 = idf(docFreq=1811, maxDocs=44218)
              0.030753274 = queryNorm
            0.2621688 = fieldWeight in 701, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              4.1947007 = idf(docFreq=1811, maxDocs=44218)
              0.03125 = fieldNorm(doc=701)
      0.5 = coord(1/2)
  0.8 = coord(8/10)

Abstract: By the explosion of possibilities for a ubiquitous content production, the information overload problem reaches the level of complexity which cannot be managed by traditional modelling approaches anymore. Due to their pure syntactical nature traditional information retrieval approaches did not succeed in treating content itself (i.e. its meaning, and not its representation). This leads to a very low usefulness of the results of a retrieval process for a user's task at hand. In the last ten years ontologies have been emerged from an interesting conceptualisation paradigm to a very promising (semantic) modelling technology, especially in the context of the Semantic Web. From the information retrieval point of view, ontologies enable a machine-understandable form of content description, such that the retrieval process can be driven by the meaning of the content. However, the very ambiguous nature of the retrieval process in which a user, due to the unfamiliarity with the underlying repository and/or query syntax, just approximates his information need in a query, implies a necessity to include the user in the retrieval process more actively in order to close the gap between the meaning of the content and the meaning of a user's query (i.e. his information need). This thesis lays foundation for such an ontology-based interactive retrieval process, in which the retrieval system interacts with a user in order to conceptually interpret the meaning of his query, whereas the underlying domain ontology drives the conceptualisation process. In that way the retrieval process evolves from a query evaluation process into a highly interactive cooperation between a user and the retrieval system, in which the system tries to anticipate the user's information need and to deliver the relevant content proactively. Moreover, the notion of content relevance for a user's query evolves from a content dependent artefact to the multidimensional context-dependent structure, strongly influenced by the user's preferences. This cooperation process is realized as the so-called Librarian Agent Query Refinement Process. In order to clarify the impact of an ontology on the retrieval process (regarding its complexity and quality), a set of methods and tools for different levels of content and query formalisation is developed, ranging from pure ontology-based inferencing to keyword-based querying in which semantics automatically emerges from the results. Our evaluation studies have shown that the possibilities to conceptualize a user's information need in the right manner and to interpret the retrieval results accordingly are key issues for realizing much more meaningful information retrieval systems.
Content: Vgl.: http%3A%2F%2Fdigbib.ubka.uni-karlsruhe.de%2Fvolltexte%2Fdocuments%2F1627&ei=tAtYUYrBNoHKtQb3l4GYBw&usg=AFQjCNHeaxKkKU3-u54LWxMNYGXaaDLCGw&sig2=8WykXWQoDKjDSdGtAakH2Q&bvm=bv.44442042,d.Yms.

Zeng, Q.; Yu, M.; Yu, W.; Xiong, J.; Shi, Y.; Jiang, M.: Faceted hierarchy : a new graph type to organize scientific concepts and a construction method (2019) 0.38

0.38475552 = product of:
  0.6412592 = sum of:
    0.048844352 = product of:
      0.14653306 = sum of:
        0.14653306 = weight(_text_:3a in 400) [ClassicSimilarity], result of:
          0.14653306 = score(doc=400,freq=2.0), product of:
            0.2607266 = queryWeight, product of:
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.030753274 = queryNorm
            0.56201804 = fieldWeight in 400, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.046875 = fieldNorm(doc=400)
      0.33333334 = coord(1/3)
    0.14653306 = weight(_text_:2f in 400) [ClassicSimilarity], result of:
      0.14653306 = score(doc=400,freq=2.0), product of:
        0.2607266 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.030753274 = queryNorm
        0.56201804 = fieldWeight in 400, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.046875 = fieldNorm(doc=400)
    0.14653306 = weight(_text_:2f in 400) [ClassicSimilarity], result of:
      0.14653306 = score(doc=400,freq=2.0), product of:
        0.2607266 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.030753274 = queryNorm
        0.56201804 = fieldWeight in 400, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.046875 = fieldNorm(doc=400)
    0.0062825847 = weight(_text_:information in 400) [ClassicSimilarity], result of:
      0.0062825847 = score(doc=400,freq=2.0), product of:
        0.05398669 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.030753274 = queryNorm
        0.116372846 = fieldWeight in 400, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=400)
    0.14653306 = weight(_text_:2f in 400) [ClassicSimilarity], result of:
      0.14653306 = score(doc=400,freq=2.0), product of:
        0.2607266 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.030753274 = queryNorm
        0.56201804 = fieldWeight in 400, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.046875 = fieldNorm(doc=400)
    0.14653306 = weight(_text_:2f in 400) [ClassicSimilarity], result of:
      0.14653306 = score(doc=400,freq=2.0), product of:
        0.2607266 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.030753274 = queryNorm
        0.56201804 = fieldWeight in 400, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.046875 = fieldNorm(doc=400)
  0.6 = coord(6/10)

Abstract: On a scientific concept hierarchy, a parent concept may have a few attributes, each of which has multiple values being a group of child concepts. We call these attributes facets: classification has a few facets such as application (e.g., face recognition), model (e.g., svm, knn), and metric (e.g., precision). In this work, we aim at building faceted concept hierarchies from scientific literature. Hierarchy construction methods heavily rely on hypernym detection, however, the faceted relations are parent-to-child links but the hypernym relation is a multi-hop, i.e., ancestor-to-descendent link with a specific facet "type-of". We use information extraction techniques to find synonyms, sibling concepts, and ancestor-descendent relations from a data science corpus. And we propose a hierarchy growth algorithm to infer the parent-child links from the three types of relationships. It resolves conflicts by maintaining the acyclic structure of a hierarchy.
Content: Vgl.: https%3A%2F%2Faclanthology.org%2FD19-5317.pdf&usg=AOvVaw0ZZFyq5wWTtNTvNkrvjlGA.

Thenmalar, S.; Geetha, T.V.: Enhanced ontology-based indexing and searching (2014) 0.04
```
0.03870782 = product of:
  0.09676954 = sum of:
    0.010365736 = weight(_text_:information in 1633) [ClassicSimilarity], result of:
      0.010365736 = score(doc=1633,freq=16.0), product of:
        0.05398669 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.030753274 = queryNorm
        0.1920054 = fieldWeight in 1633, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.02734375 = fieldNorm(doc=1633)
    0.018847398 = weight(_text_:retrieval in 1633) [ClassicSimilarity], result of:
      0.018847398 = score(doc=1633,freq=6.0), product of:
        0.093026035 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.030753274 = queryNorm
        0.20260347 = fieldWeight in 1633, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.02734375 = fieldNorm(doc=1633)
    0.06026478 = weight(_text_:ranking in 1633) [ClassicSimilarity], result of:
      0.06026478 = score(doc=1633,freq=6.0), product of:
        0.16634533 = queryWeight, product of:
          5.4090285 = idf(docFreq=537, maxDocs=44218)
          0.030753274 = queryNorm
        0.3622872 = fieldWeight in 1633, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          5.4090285 = idf(docFreq=537, maxDocs=44218)
          0.02734375 = fieldNorm(doc=1633)
    0.007291627 = product of:
      0.014583254 = sum of:
        0.014583254 = weight(_text_:22 in 1633) [ClassicSimilarity], result of:
          0.014583254 = score(doc=1633,freq=2.0), product of:
            0.107692726 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.030753274 = queryNorm
            0.1354154 = fieldWeight in 1633, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.02734375 = fieldNorm(doc=1633)
      0.5 = coord(1/2)
  0.4 = coord(4/10)
```
Abstract

Purpose - The purpose of this paper is to improve the conceptual-based search by incorporating structural ontological information such as concepts and relations. Generally, Semantic-based information retrieval aims to identify relevant information based on the meanings of the query terms or on the context of the terms and the performance of semantic information retrieval is carried out through standard measures-precision and recall. Higher precision leads to the (meaningful) relevant documents obtained and lower recall leads to the less coverage of the concepts. Design/methodology/approach - In this paper, the authors enhance the existing ontology-based indexing proposed by Kohler et al., by incorporating sibling information to the index. The index designed by Kohler et al., contains only super and sub-concepts from the ontology. In addition, in our approach, we focus on two tasks; query expansion and ranking of the expanded queries, to improve the efficiency of the ontology-based search. The aforementioned tasks make use of ontological concepts, and relations existing between those concepts so as to obtain semantically more relevant search results for a given query. Findings - The proposed ontology-based indexing technique is investigated by analysing the coverage of concepts that are being populated in the index. Here, we introduce a new measure called index enhancement measure, to estimate the coverage of ontological concepts being indexed. We have evaluated the ontology-based search for the tourism domain with the tourism documents and tourism-specific ontology. The comparison of search results based on the use of ontology "with and without query expansion" is examined to estimate the efficiency of the proposed query expansion task. The ranking is compared with the ORank system to evaluate the performance of our ontology-based search. From these analyses, the ontology-based search results shows better recall when compared to the other concept-based search systems. The mean average precision of the ontology-based search is found to be 0.79 and the recall is found to be 0.65, the ORank system has the mean average precision of 0.62 and the recall is found to be 0.51, while the concept-based search has the mean average precision of 0.56 and the recall is found to be 0.42. Practical implications - When the concept is not present in the domain-specific ontology, the concept cannot be indexed. When the given query term is not available in the ontology then the term-based results are retrieved. Originality/value - In addition to super and sub-concepts, we incorporate the concepts present in same level (siblings) to the ontological index. The structural information from the ontology is determined for the query expansion. The ranking of the documents depends on the type of the query (single concept query, multiple concept queries and concept with relation queries) and the ontological relations that exists in the query and the documents. With this ontological structural information, the search results showed us better coverage of concepts with respect to the query.

Date

20. 1.2015 18:30:22

Source

Aslib journal of information management. 66(2014) no.6, S.678-696

Theme

Semantisches Umfeld in Indexierung u. Retrieval

Lee, J.; Min, J.-K.; Oh, A.; Chung, C.-W.: Effective ranking and search techniques for Web resources considering semantic relationships (2014) 0.04

0.035564274 = product of:
  0.11854757 = sum of:
    0.010470974 = weight(_text_:information in 2670) [ClassicSimilarity], result of:
      0.010470974 = score(doc=2670,freq=8.0), product of:
        0.05398669 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.030753274 = queryNorm
        0.19395474 = fieldWeight in 2670, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2670)
    0.02198405 = weight(_text_:retrieval in 2670) [ClassicSimilarity], result of:
      0.02198405 = score(doc=2670,freq=4.0), product of:
        0.093026035 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.030753274 = queryNorm
        0.23632148 = fieldWeight in 2670, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2670)
    0.08609255 = weight(_text_:ranking in 2670) [ClassicSimilarity], result of:
      0.08609255 = score(doc=2670,freq=6.0), product of:
        0.16634533 = queryWeight, product of:
          5.4090285 = idf(docFreq=537, maxDocs=44218)
          0.030753274 = queryNorm
        0.51755315 = fieldWeight in 2670, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          5.4090285 = idf(docFreq=537, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2670)
  0.3 = coord(3/10)

Abstract: On the Semantic Web, the types of resources and the semantic relationships between resources are defined in an ontology. By using that information, the accuracy of information retrieval can be improved. In this paper, we present effective ranking and search techniques considering the semantic relationships in an ontology. Our technique retrieves top-k resources which are the most relevant to query keywords through the semantic relationships. To do this, we propose a weighting measure for the semantic relationship. Based on this measure, we propose a novel ranking method which considers the number of meaningful semantic relationships between a resource and keywords as well as the coverage and discriminating power of keywords. In order to improve the efficiency of the search, we prune the unnecessary search space using the length and weight thresholds of the semantic relationship path. In addition, we exploit Threshold Algorithm based on an extended inverted index to answer top-k results efficiently. The experimental results using real data sets demonstrate that our retrieval method using the semantic information generates accurate results efficiently compared to the traditional methods.
Source: Information processing and management. 50(2014) no.1, S.132-155

Vallet, D.; Fernández, M.; Castells, P.: ¬An ontology-based information retrieval model (2005) 0.03

0.030971227 = product of:
  0.10323742 = sum of:
    0.0062825847 = weight(_text_:information in 4708) [ClassicSimilarity], result of:
      0.0062825847 = score(doc=4708,freq=2.0), product of:
        0.05398669 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.030753274 = queryNorm
        0.116372846 = fieldWeight in 4708, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=4708)
    0.03730817 = weight(_text_:retrieval in 4708) [ClassicSimilarity], result of:
      0.03730817 = score(doc=4708,freq=8.0), product of:
        0.093026035 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.030753274 = queryNorm
        0.40105087 = fieldWeight in 4708, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.046875 = fieldNorm(doc=4708)
    0.059646662 = weight(_text_:ranking in 4708) [ClassicSimilarity], result of:
      0.059646662 = score(doc=4708,freq=2.0), product of:
        0.16634533 = queryWeight, product of:
          5.4090285 = idf(docFreq=537, maxDocs=44218)
          0.030753274 = queryNorm
        0.35857132 = fieldWeight in 4708, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.4090285 = idf(docFreq=537, maxDocs=44218)
          0.046875 = fieldNorm(doc=4708)
  0.3 = coord(3/10)

Abstract: Semantic search has been one of the motivations of the Semantic Web since it was envisioned. We propose a model for the exploitation of ontologybased KBs to improve search over large document repositories. Our approach includes an ontology-based scheme for the semi-automatic annotation of documents, and a retrieval system. The retrieval model is based on an adaptation of the classic vector-space model, including an annotation weighting algorithm, and a ranking algorithm. Semantic search is combined with keyword-based search to achieve tolerance to KB incompleteness. Our proposal is illustrated with sample experiments showing improvements with respect to keyword-based search, and providing ground for further research and discussion.
Theme: Semantisches Umfeld in Indexierung u. Retrieval

Maheswari, J.U.; Karpagam, G.R.: ¬A conceptual framework for ontology based information retrieval (2010) 0.03
```
0.0291869 = product of:
  0.09728967 = sum of:
    0.012824273 = weight(_text_:information in 702) [ClassicSimilarity], result of:
      0.012824273 = score(doc=702,freq=12.0), product of:
        0.05398669 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.030753274 = queryNorm
        0.23754507 = fieldWeight in 702, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=702)
    0.034759834 = weight(_text_:retrieval in 702) [ClassicSimilarity], result of:
      0.034759834 = score(doc=702,freq=10.0), product of:
        0.093026035 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.030753274 = queryNorm
        0.37365708 = fieldWeight in 702, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.0390625 = fieldNorm(doc=702)
    0.049705554 = weight(_text_:ranking in 702) [ClassicSimilarity], result of:
      0.049705554 = score(doc=702,freq=2.0), product of:
        0.16634533 = queryWeight, product of:
          5.4090285 = idf(docFreq=537, maxDocs=44218)
          0.030753274 = queryNorm
        0.29880944 = fieldWeight in 702, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.4090285 = idf(docFreq=537, maxDocs=44218)
          0.0390625 = fieldNorm(doc=702)
  0.3 = coord(3/10)
```
Abstract

Improving Information retrieval by employing the use of ontologies to overcome the limitations of syntactic search has been one of the inspirations since its emergence. This paper proposes a conceptual framework to exploit ontology based Information retrieval. This framework constitutes of five phases namely Query parsing, word stemming, ontology matching, weight assignment, ranking and Information retrieval. In the first phase, the user query is parsed into sequence of words. The parsed contents are curtailed to identify the significant word by ignoring superfluous terms such as "to", "is","ed", "about" and the like in the stemming phase. The objective of the stemming phase is to throttle feature descriptors to root words, which in turn will increase efficiency; this reduces the time consumed for searching the superfluous terms, which may not significantly influence the effectiveness of the retrieval process. In the third phase ontology matching is carried out by matching the parsed words with the relevant terms in the existing ontology. If the ontology does not exist, it is recommended to generate the required ontology. In the fourth phase the weights are assigned based on the distance between the stemmed words and the terms in the ontology uses improved matchmaking algorithm. The range of weights varies from 0 to 1 based on the level of distance in the ontology (superclass-subclass). The aggregate weights are calculated for the all the combination of stemmed words. The combination with the highest score is ranked as the best and the corresponding information is retrieved. The conceptual workflow is illustrated with an e-governance case study Academic Information System.

Renear, A.H.; Wickett, K.M.; Urban, R.J.; Dubin, D.; Shreeves, S.L.: Collection/item metadata relationships (2008) 0.03

0.028841147 = product of:
  0.09613715 = sum of:
    0.008884916 = weight(_text_:information in 2623) [ClassicSimilarity], result of:
      0.008884916 = score(doc=2623,freq=4.0), product of:
        0.05398669 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.030753274 = queryNorm
        0.16457605 = fieldWeight in 2623, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=2623)
    0.026380861 = weight(_text_:retrieval in 2623) [ClassicSimilarity], result of:
      0.026380861 = score(doc=2623,freq=4.0), product of:
        0.093026035 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.030753274 = queryNorm
        0.2835858 = fieldWeight in 2623, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.046875 = fieldNorm(doc=2623)
    0.06087137 = sum of:
      0.035871506 = weight(_text_:evaluation in 2623) [ClassicSimilarity], result of:
        0.035871506 = score(doc=2623,freq=2.0), product of:
          0.12900078 = queryWeight, product of:
            4.1947007 = idf(docFreq=1811, maxDocs=44218)
            0.030753274 = queryNorm
          0.278072 = fieldWeight in 2623, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            4.1947007 = idf(docFreq=1811, maxDocs=44218)
            0.046875 = fieldNorm(doc=2623)
      0.024999864 = weight(_text_:22 in 2623) [ClassicSimilarity], result of:
        0.024999864 = score(doc=2623,freq=2.0), product of:
          0.107692726 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.030753274 = queryNorm
          0.23214069 = fieldWeight in 2623, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046875 = fieldNorm(doc=2623)
  0.3 = coord(3/10)

Abstract: Contemporary retrieval systems, which search across collections, usually ignore collection-level metadata. Alternative approaches, exploiting collection-level information, will require an understanding of the various kinds of relationships that can obtain between collection-level and item-level metadata. This paper outlines the problem and describes a project that is developing a logic-based framework for classifying collection/item metadata relationships. This framework will support (i) metadata specification developers defining metadata elements, (ii) metadata creators describing objects, and (iii) system designers implementing systems that take advantage of collection-level metadata. We present three examples of collection/item metadata relationship categories, attribute/value-propagation, value-propagation, and value-constraint and show that even in these simple cases a precise formulation requires modal notions in addition to first-order logic. These formulations are related to recent work in information retrieval and ontology evaluation.
Source: Metadata for semantic and social applications : proceedings of the International Conference on Dublin Core and Metadata Applications, Berlin, 22 - 26 September 2008, DC 2008: Berlin, Germany / ed. by Jane Greenberg and Wolfgang Klas

Aitken, S.; Reid, S.: Evaluation of an ontology-based information retrieval tool (2000) 0.03

0.02862323 = product of:
  0.095410764 = sum of:
    0.011846555 = weight(_text_:information in 2862) [ClassicSimilarity], result of:
      0.011846555 = score(doc=2862,freq=4.0), product of:
        0.05398669 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.030753274 = queryNorm
        0.21943474 = fieldWeight in 2862, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0625 = fieldNorm(doc=2862)
    0.04974423 = weight(_text_:retrieval in 2862) [ClassicSimilarity], result of:
      0.04974423 = score(doc=2862,freq=8.0), product of:
        0.093026035 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.030753274 = queryNorm
        0.5347345 = fieldWeight in 2862, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.0625 = fieldNorm(doc=2862)
    0.03381998 = product of:
      0.06763996 = sum of:
        0.06763996 = weight(_text_:evaluation in 2862) [ClassicSimilarity], result of:
          0.06763996 = score(doc=2862,freq=4.0), product of:
            0.12900078 = queryWeight, product of:
              4.1947007 = idf(docFreq=1811, maxDocs=44218)
              0.030753274 = queryNorm
            0.5243376 = fieldWeight in 2862, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              4.1947007 = idf(docFreq=1811, maxDocs=44218)
              0.0625 = fieldNorm(doc=2862)
      0.5 = coord(1/2)
  0.3 = coord(3/10)

Abstract: This paper evaluates the use of an explicit domain ontology in an information retrieval tool. The evaluation compares the performance of ontology-enhanced retrieval with keyword retrieval for a fixed set of queries across several data sets. The robustness of the IR approach is assessed by comparing the performance of the tool on the original data set with that on previously unseen data.

Gödert, W.; Hubrich, J.; Nagelschmidt, M.: Semantic knowledge representation for information retrieval (2014) 0.03

0.025085254 = product of:
  0.08361751 = sum of:
    0.021763513 = weight(_text_:information in 987) [ClassicSimilarity], result of:
      0.021763513 = score(doc=987,freq=24.0), product of:
        0.05398669 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.030753274 = queryNorm
        0.40312737 = fieldWeight in 987, product of:
          4.8989797 = tf(freq=24.0), with freq of:
            24.0 = termFreq=24.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=987)
    0.04935407 = weight(_text_:retrieval in 987) [ClassicSimilarity], result of:
      0.04935407 = score(doc=987,freq=14.0), product of:
        0.093026035 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.030753274 = queryNorm
        0.5305404 = fieldWeight in 987, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.046875 = fieldNorm(doc=987)
    0.012499932 = product of:
      0.024999864 = sum of:
        0.024999864 = weight(_text_:22 in 987) [ClassicSimilarity], result of:
          0.024999864 = score(doc=987,freq=2.0), product of:
            0.107692726 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.030753274 = queryNorm
            0.23214069 = fieldWeight in 987, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=987)
      0.5 = coord(1/2)
  0.3 = coord(3/10)

Abstract: This book covers the basics of semantic web technologies and indexing languages, and describes their contribution to improve languages as a tool for subject queries and knowledge exploration. The book is relevant to information scientists, knowledge workers and indexers. It provides a suitable combination of theoretical foundations and practical applications.
Content: Introduction: envisioning semantic information spacesIndexing and knowledge organization -- Semantic technologies for knowledge representation -- Information retrieval and knowledge exploration -- Approaches to handle heterogeneity -- Problems with establishing semantic interoperability -- Formalization in indexing languages -- Typification of semantic relations -- Inferences in retrieval processes -- Semantic interoperability and inferences -- Remaining research questions.
Date: 23. 7.2017 13:49:22
LCSH: Information retrieval
Knowledge representation (Information theory)
Information organization
RSWK: Information Retrieval
Subject: Information retrieval
Knowledge representation (Information theory)
Information organization
Information Retrieval

Rajasurya, S.; Muralidharan, T.; Devi, S.; Swamynathan, S.: Semantic information retrieval using ontology in university domain (2012) 0.02

0.024227323 = product of:
  0.08075774 = sum of:
    0.0090681305 = weight(_text_:information in 2861) [ClassicSimilarity], result of:
      0.0090681305 = score(doc=2861,freq=6.0), product of:
        0.05398669 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.030753274 = queryNorm
        0.16796975 = fieldWeight in 2861, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2861)
    0.02198405 = weight(_text_:retrieval in 2861) [ClassicSimilarity], result of:
      0.02198405 = score(doc=2861,freq=4.0), product of:
        0.093026035 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.030753274 = queryNorm
        0.23632148 = fieldWeight in 2861, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2861)
    0.049705554 = weight(_text_:ranking in 2861) [ClassicSimilarity], result of:
      0.049705554 = score(doc=2861,freq=2.0), product of:
        0.16634533 = queryWeight, product of:
          5.4090285 = idf(docFreq=537, maxDocs=44218)
          0.030753274 = queryNorm
        0.29880944 = fieldWeight in 2861, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.4090285 = idf(docFreq=537, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2861)
  0.3 = coord(3/10)

Abstract: Today's conventional search engines hardly do provide the essential content relevant to the user's search query. This is because the context and semantics of the request made by the user is not analyzed to the full extent. So here the need for a semantic web search arises. SWS is upcoming in the area of web search which combines Natural Language Processing and Artificial Intelligence. The objective of the work done here is to design, develop and implement a semantic search engine- SIEU(Semantic Information Extraction in University Domain) confined to the university domain. SIEU uses ontology as a knowledge base for the information retrieval process. It is not just a mere keyword search. It is one layer above what Google or any other search engines retrieve by analyzing just the keywords. Here the query is analyzed both syntactically and semantically. The developed system retrieves the web results more relevant to the user query through keyword expansion. The results obtained here will be accurate enough to satisfy the request made by the user. The level of accuracy will be enhanced since the query is analyzed semantically. The system will be of great use to the developers and researchers who work on web. The Google results are re-ranked and optimized for providing the relevant links. For ranking an algorithm has been applied which fetches more apt results for the user query.

Yi, M.: Information organization and retrieval using a topic maps-based ontology : results of a task-based evaluation (2008) 0.02

0.024132898 = product of:
  0.08044299 = sum of:
    0.017769832 = weight(_text_:information in 2369) [ClassicSimilarity], result of:
      0.017769832 = score(doc=2369,freq=16.0), product of:
        0.05398669 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.030753274 = queryNorm
        0.3291521 = fieldWeight in 2369, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=2369)
    0.03730817 = weight(_text_:retrieval in 2369) [ClassicSimilarity], result of:
      0.03730817 = score(doc=2369,freq=8.0), product of:
        0.093026035 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.030753274 = queryNorm
        0.40105087 = fieldWeight in 2369, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.046875 = fieldNorm(doc=2369)
    0.025364986 = product of:
      0.05072997 = sum of:
        0.05072997 = weight(_text_:evaluation in 2369) [ClassicSimilarity], result of:
          0.05072997 = score(doc=2369,freq=4.0), product of:
            0.12900078 = queryWeight, product of:
              4.1947007 = idf(docFreq=1811, maxDocs=44218)
              0.030753274 = queryNorm
            0.3932532 = fieldWeight in 2369, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              4.1947007 = idf(docFreq=1811, maxDocs=44218)
              0.046875 = fieldNorm(doc=2369)
      0.5 = coord(1/2)
  0.3 = coord(3/10)

Abstract: As information becomes richer and more complex, alternative information-organization methods are needed to more effectively and efficiently retrieve information from various systems, including the Web. The objective of this study is to explore how a Topic Maps-based ontology approach affects users' searching performance. Forty participants participated in a task-based evaluation where two dependent variables, recall and search time, were measured. The results of this study indicate that a Topic Maps-based ontology information retrieval (TOIR) system has a significant and positive effect on both recall and search time, compared to a thesaurus-based information retrieval (TIR) system. These results suggest that the inclusion of a Topic Maps-based ontology is a beneficial approach to take when designing information retrieval systems.
Source: Journal of the American Society for Information Science and Technology. 59(2008) no.12, S.1898-1911

Teskey, F.N.: Enriched knowledge representation for information retrieval (1987) 0.02

0.023805395 = product of:
  0.07935131 = sum of:
    0.020731471 = weight(_text_:information in 698) [ClassicSimilarity], result of:
      0.020731471 = score(doc=698,freq=16.0), product of:
        0.05398669 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.030753274 = queryNorm
        0.3840108 = fieldWeight in 698, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0546875 = fieldNorm(doc=698)
    0.037694797 = weight(_text_:retrieval in 698) [ClassicSimilarity], result of:
      0.037694797 = score(doc=698,freq=6.0), product of:
        0.093026035 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.030753274 = queryNorm
        0.40520695 = fieldWeight in 698, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.0546875 = fieldNorm(doc=698)
    0.020925045 = product of:
      0.04185009 = sum of:
        0.04185009 = weight(_text_:evaluation in 698) [ClassicSimilarity], result of:
          0.04185009 = score(doc=698,freq=2.0), product of:
            0.12900078 = queryWeight, product of:
              4.1947007 = idf(docFreq=1811, maxDocs=44218)
              0.030753274 = queryNorm
            0.32441732 = fieldWeight in 698, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.1947007 = idf(docFreq=1811, maxDocs=44218)
              0.0546875 = fieldNorm(doc=698)
      0.5 = coord(1/2)
  0.3 = coord(3/10)

Abstract: In this paper we identify the need for a new theory of information. An information model is developed which distinguishes between data, as directly observable facts, information, as structured collections of data, and knowledge as methods of using information. The model is intended to support a wide range of information systems. In the paper we develop the use of the model for a semantic information retrieval system using the concept of semantic categories. The likely benefits of this area discussed, though as yet no detailed evaluation has been conducted.
Source: SIGIR'87: Proceedings of the 10th annual international ACM SIGIR conference on Research and development in information retrieval

Mayfield, J.; Finin, T.: Information retrieval on the Semantic Web : integrating inference and retrieval 0.02

0.02208383 = product of:
  0.073612764 = sum of:
    0.010365736 = weight(_text_:information in 4330) [ClassicSimilarity], result of:
      0.010365736 = score(doc=4330,freq=4.0), product of:
        0.05398669 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.030753274 = queryNorm
        0.1920054 = fieldWeight in 4330, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0546875 = fieldNorm(doc=4330)
    0.048663773 = weight(_text_:retrieval in 4330) [ClassicSimilarity], result of:
      0.048663773 = score(doc=4330,freq=10.0), product of:
        0.093026035 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.030753274 = queryNorm
        0.5231199 = fieldWeight in 4330, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.0546875 = fieldNorm(doc=4330)
    0.014583254 = product of:
      0.029166508 = sum of:
        0.029166508 = weight(_text_:22 in 4330) [ClassicSimilarity], result of:
          0.029166508 = score(doc=4330,freq=2.0), product of:
            0.107692726 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.030753274 = queryNorm
            0.2708308 = fieldWeight in 4330, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4330)
      0.5 = coord(1/2)
  0.3 = coord(3/10)

Abstract: One vision of the Semantic Web is that it will be much like the Web we know today, except that documents will be enriched by annotations in machine understandable markup. These annotations will provide metadata about the documents as well as machine interpretable statements capturing some of the meaning of document content. We discuss how the information retrieval paradigm might be recast in such an environment. We suggest that retrieval can be tightly bound to inference. Doing so makes today's Web search engines useful to Semantic Web inference engines, and causes improvements in either retrieval or inference to lead directly to improvements in the other.
Date: 12. 2.2011 17:35:22

Atanassova, I.; Bertin, M.: Semantic facets for scientific information retrieval (2014) 0.02

0.021983761 = product of:
  0.0732792 = sum of:
    0.014659365 = weight(_text_:information in 4471) [ClassicSimilarity], result of:
      0.014659365 = score(doc=4471,freq=8.0), product of:
        0.05398669 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.030753274 = queryNorm
        0.27153665 = fieldWeight in 4471, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0546875 = fieldNorm(doc=4471)
    0.037694797 = weight(_text_:retrieval in 4471) [ClassicSimilarity], result of:
      0.037694797 = score(doc=4471,freq=6.0), product of:
        0.093026035 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.030753274 = queryNorm
        0.40520695 = fieldWeight in 4471, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.0546875 = fieldNorm(doc=4471)
    0.020925045 = product of:
      0.04185009 = sum of:
        0.04185009 = weight(_text_:evaluation in 4471) [ClassicSimilarity], result of:
          0.04185009 = score(doc=4471,freq=2.0), product of:
            0.12900078 = queryWeight, product of:
              4.1947007 = idf(docFreq=1811, maxDocs=44218)
              0.030753274 = queryNorm
            0.32441732 = fieldWeight in 4471, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.1947007 = idf(docFreq=1811, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4471)
      0.5 = coord(1/2)
  0.3 = coord(3/10)

Abstract: We present an Information Retrieval System for scientific publications that provides the possibility to filter results according to semantic facets. We use sentence-level semantic annotations that identify specific semantic relations in texts, such as methods, definitions, hypotheses, that correspond to common information needs related to scientific literature. The semantic annotations are obtained using a rule-based method that identifies linguistic clues organized into a linguistic ontology. The system is implemented using Solr Search Server and offers efficient search and navigation in scientific papers.
Series: Communications in computer and information science; vol.475
Source: Semantic Web Evaluation Challenge. SemWebEval 2014 at ESWC 2014, Anissaras, Crete, Greece, May 25-29, 2014, Revised Selected Papers. Eds.: V. Presutti et al
Theme: Semantisches Umfeld in Indexierung u. Retrieval

Kara, S.: ¬An ontology-based retrieval system using semantic indexing (2012) 0.02

0.020342728 = product of:
  0.06780909 = sum of:
    0.0125651695 = weight(_text_:information in 3829) [ClassicSimilarity], result of:
      0.0125651695 = score(doc=3829,freq=8.0), product of:
        0.05398669 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.030753274 = queryNorm
        0.23274569 = fieldWeight in 3829, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=3829)
    0.03730817 = weight(_text_:retrieval in 3829) [ClassicSimilarity], result of:
      0.03730817 = score(doc=3829,freq=8.0), product of:
        0.093026035 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.030753274 = queryNorm
        0.40105087 = fieldWeight in 3829, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.046875 = fieldNorm(doc=3829)
    0.017935753 = product of:
      0.035871506 = sum of:
        0.035871506 = weight(_text_:evaluation in 3829) [ClassicSimilarity], result of:
          0.035871506 = score(doc=3829,freq=2.0), product of:
            0.12900078 = queryWeight, product of:
              4.1947007 = idf(docFreq=1811, maxDocs=44218)
              0.030753274 = queryNorm
            0.278072 = fieldWeight in 3829, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.1947007 = idf(docFreq=1811, maxDocs=44218)
              0.046875 = fieldNorm(doc=3829)
      0.5 = coord(1/2)
  0.3 = coord(3/10)

Abstract: In this thesis, we present an ontology-based information extraction and retrieval system and its application to soccer domain. In general, we deal with three issues in semantic search, namely, usability, scalability and retrieval performance. We propose a keyword-based semantic retrieval approach. The performance of the system is improved considerably using domain-specific information extraction, inference and rules. Scalability is achieved by adapting a semantic indexing approach. The system is implemented using the state-of-the-art technologies in SemanticWeb and its performance is evaluated against traditional systems as well as the query expansion methods. Furthermore, a detailed evaluation is provided to observe the performance gain due to domain-specific information extraction and inference. Finally, we show how we use semantic indexing to solve simple structural ambiguities.
Source: Information Systems. 37(2012) no. 4, S.294-305

Styltsvig, H.B.: Ontology-based information retrieval (2006) 0.02
```
0.020089902 = product of:
  0.06696634 = sum of:
    0.0110814385 = weight(_text_:information in 1154) [ClassicSimilarity], result of:
      0.0110814385 = score(doc=1154,freq=14.0), product of:
        0.05398669 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.030753274 = queryNorm
        0.20526241 = fieldWeight in 1154, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03125 = fieldNorm(doc=1154)
    0.03517448 = weight(_text_:retrieval in 1154) [ClassicSimilarity], result of:
      0.03517448 = score(doc=1154,freq=16.0), product of:
        0.093026035 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.030753274 = queryNorm
        0.37811437 = fieldWeight in 1154, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.03125 = fieldNorm(doc=1154)
    0.020710424 = product of:
      0.041420847 = sum of:
        0.041420847 = weight(_text_:evaluation in 1154) [ClassicSimilarity], result of:
          0.041420847 = score(doc=1154,freq=6.0), product of:
            0.12900078 = queryWeight, product of:
              4.1947007 = idf(docFreq=1811, maxDocs=44218)
              0.030753274 = queryNorm
            0.3210899 = fieldWeight in 1154, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              4.1947007 = idf(docFreq=1811, maxDocs=44218)
              0.03125 = fieldNorm(doc=1154)
      0.5 = coord(1/2)
  0.3 = coord(3/10)
```
Abstract

In this thesis, we will present methods for introducing ontologies in information retrieval. The main hypothesis is that the inclusion of conceptual knowledge such as ontologies in the information retrieval process can contribute to the solution of major problems currently found in information retrieval. This utilization of ontologies has a number of challenges. Our focus is on the use of similarity measures derived from the knowledge about relations between concepts in ontologies, the recognition of semantic information in texts and the mapping of this knowledge into the ontologies in use, as well as how to fuse together the ideas of ontological similarity and ontological indexing into a realistic information retrieval scenario. To achieve the recognition of semantic knowledge in a text, shallow natural language processing is used during indexing that reveals knowledge to the level of noun phrases. Furthermore, we briefly cover the identification of semantic relations inside and between noun phrases, as well as discuss which kind of problems are caused by an increase in compoundness with respect to the structure of concepts in the evaluation of queries. Measuring similarity between concepts based on distances in the structure of the ontology is discussed. In addition, a shared nodes measure is introduced and, based on a set of intuitive similarity properties, compared to a number of different measures. In this comparison the shared nodes measure appears to be superior, though more computationally complex. Some of the major problems of shared nodes which relate to the way relations differ with respect to the degree they bring the concepts they connect closer are discussed. A generalized measure called weighted shared nodes is introduced to deal with these problems. Finally, the utilization of concept similarity in query evaluation is discussed. A semantic expansion approach that incorporates concept similarity is introduced and a generalized fuzzy set retrieval model that applies expansion during query evaluation is presented. While not commonly used in present information retrieval systems, it appears that the fuzzy set model comprises the flexibility needed when generalizing to an ontology-based retrieval model and, with the introduction of a hierarchical fuzzy aggregation principle, compound concepts can be handled in a straightforward and natural manner.
Koopman, B.; Zuccon, G.; Bruza, P.; Sitbon, L.; Lawley, M.: Information retrieval as semantic inference : a graph Inference model applied to medical search (2016) 0.02
```
0.019383911 = product of:
  0.06461304 = sum of:
    0.00837678 = weight(_text_:information in 3260) [ClassicSimilarity], result of:
      0.00837678 = score(doc=3260,freq=8.0), product of:
        0.05398669 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.030753274 = queryNorm
        0.1551638 = fieldWeight in 3260, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03125 = fieldNorm(doc=3260)
    0.039326265 = weight(_text_:retrieval in 3260) [ClassicSimilarity], result of:
      0.039326265 = score(doc=3260,freq=20.0), product of:
        0.093026035 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.030753274 = queryNorm
        0.42274472 = fieldWeight in 3260, product of:
          4.472136 = tf(freq=20.0), with freq of:
            20.0 = termFreq=20.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.03125 = fieldNorm(doc=3260)
    0.01690999 = product of:
      0.03381998 = sum of:
        0.03381998 = weight(_text_:evaluation in 3260) [ClassicSimilarity], result of:
          0.03381998 = score(doc=3260,freq=4.0), product of:
            0.12900078 = queryWeight, product of:
              4.1947007 = idf(docFreq=1811, maxDocs=44218)
              0.030753274 = queryNorm
            0.2621688 = fieldWeight in 3260, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              4.1947007 = idf(docFreq=1811, maxDocs=44218)
              0.03125 = fieldNorm(doc=3260)
      0.5 = coord(1/2)
  0.3 = coord(3/10)
```
Abstract

This paper presents a Graph Inference retrieval model that integrates structured knowledge resources, statistical information retrieval methods and inference in a unified framework. Key components of the model are a graph-based representation of the corpus and retrieval driven by an inference mechanism achieved as a traversal over the graph. The model is proposed to tackle the semantic gap problem-the mismatch between the raw data and the way a human being interprets it. We break down the semantic gap problem into five core issues, each requiring a specific type of inference in order to be overcome. Our model and evaluation is applied to the medical domain because search within this domain is particularly challenging and, as we show, often requires inference. In addition, this domain features both structured knowledge resources as well as unstructured text. Our evaluation shows that inference can be effective, retrieving many new relevant documents that are not retrieved by state-of-the-art information retrieval models. We show that many retrieved documents were not pooled by keyword-based search methods, prompting us to perform additional relevance assessment on these new documents. A third of the newly retrieved documents judged were found to be relevant. Our analysis provides a thorough understanding of when and how to apply inference for retrieval, including a categorisation of queries according to the effect of inference. The inference mechanism promoted recall by retrieving new relevant documents not found by previous keyword-based approaches. In addition, it promoted precision by an effective reranking of documents. When inference is used, performance gains can generally be expected on hard queries. However, inference should not be applied universally: for easy, unambiguous queries and queries with few relevant documents, inference did adversely affect effectiveness. These conclusions reflect the fact that for retrieval as inference to be effective, a careful balancing act is involved. Finally, although the Graph Inference model is developed and applied to medical search, it is a general retrieval model applicable to other areas such as web search, where an emerging research trend is to utilise structured knowledge resources for more effective semantic search.

Source

Information Retrieval Journal. 19(2016) no.1, S.6-37

Theme

Semantisches Umfeld in Indexierung u. Retrieval
Alaya, N.; Yahia, S.B.; Lamolle, M.: Ranking with ties of OWL ontology reasoners based on learned performances (2016) 0.02
```
0.01869933 = product of:
  0.09349664 = sum of:
    0.007404097 = weight(_text_:information in 3378) [ClassicSimilarity], result of:
      0.007404097 = score(doc=3378,freq=4.0), product of:
        0.05398669 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.030753274 = queryNorm
        0.13714671 = fieldWeight in 3378, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3378)
    0.08609255 = weight(_text_:ranking in 3378) [ClassicSimilarity], result of:
      0.08609255 = score(doc=3378,freq=6.0), product of:
        0.16634533 = queryWeight, product of:
          5.4090285 = idf(docFreq=537, maxDocs=44218)
          0.030753274 = queryNorm
        0.51755315 = fieldWeight in 3378, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          5.4090285 = idf(docFreq=537, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3378)
  0.2 = coord(2/10)
```
Abstract

Over the last decade, several ontology reasoners have been proposed to overcome the computational complexity of inference tasks on expressive ontology languages such as OWL 2 DL. Nevertheless, it is well-accepted that there is no outstanding reasoner that can outperform in all input ontologies. Thus, deciding the most suitable reasoner for an ontology based application is still a time and effort consuming task. In this paper, we suggest to develop a new system to provide user support when looking for guidance over ontology reasoners. At first, we will be looking at automatically predict a single reasoner empirical performances, in particular its robustness and efficiency, over any given ontology. Later, we aim at ranking a set of candidate reasoners in a most preferred order by taking into account information regarding their predicted performances. We conducted extensive experiments covering over 2500 well selected real-world ontologies and six state-of-the-art of the most performing reasoners. Our primary prediction and ranking results are encouraging and witnessing the potential benefits of our approach.

Series

Communications in computer and information science; 631

Beppler, F.D.; Fonseca, F.T.; Pacheco, R.C.S.: Hermeneus: an architecture for an ontology-enabled information retrieval (2008) 0.02

0.017657416 = product of:
  0.05885805 = sum of:
    0.014048288 = weight(_text_:information in 3261) [ClassicSimilarity], result of:
      0.014048288 = score(doc=3261,freq=10.0), product of:
        0.05398669 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.030753274 = queryNorm
        0.2602176 = fieldWeight in 3261, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=3261)
    0.032309826 = weight(_text_:retrieval in 3261) [ClassicSimilarity], result of:
      0.032309826 = score(doc=3261,freq=6.0), product of:
        0.093026035 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.030753274 = queryNorm
        0.34732026 = fieldWeight in 3261, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.046875 = fieldNorm(doc=3261)
    0.012499932 = product of:
      0.024999864 = sum of:
        0.024999864 = weight(_text_:22 in 3261) [ClassicSimilarity], result of:
          0.024999864 = score(doc=3261,freq=2.0), product of:
            0.107692726 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.030753274 = queryNorm
            0.23214069 = fieldWeight in 3261, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=3261)
      0.5 = coord(1/2)
  0.3 = coord(3/10)

Abstract: Ontologies improve IR systems regarding its retrieval and presentation of information, which make the task of finding information more effective, efficient, and interactive. In this paper we argue that ontologies also greatly improve the engineering of such systems. We created a framework that uses ontology to drive the process of engineering an IR system. We developed a prototype that shows how a domain specialist without knowledge in the IR field can build an IR system with interactive components. The resulting system provides support for users not only to find their information needs but also to extend their state of knowledge. This way, our approach to ontology-enabled information retrieval addresses both the engineering aspect described here and also the usability aspect described elsewhere.
Date: 28.11.2016 12:43:22

Search (415 results, page 1 of 21)

Authors

Years

Languages

Types

Themes

Subjects

Classifications