Search (6 results, page 1 of 1)

Hemmje, M.: LyberWorld : eine 3D-basierte Benutzerschnittstelle für die computerunterstützte Informationssuche in Dokumentmengen (1993) 0.01

0.0060557844 = product of:
  0.012111569 = sum of:
    0.012111569 = product of:
      0.024223138 = sum of:
        0.024223138 = weight(_text_:d in 1534) [ClassicSimilarity], result of:
          0.024223138 = score(doc=1534,freq=2.0), product of:
            0.08242767 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.04338591 = queryNorm
            0.29387143 = fieldWeight in 1534, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.109375 = fieldNorm(doc=1534)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Language: d

Frederichs, A.: Natürlichsprachige Abfrage und 3-D-Visualisierung von Wissenszusammenhängen (2007) 0.00

0.0030586333 = product of:
  0.0061172666 = sum of:
    0.0061172666 = product of:
      0.012234533 = sum of:
        0.012234533 = weight(_text_:d in 566) [ClassicSimilarity], result of:
          0.012234533 = score(doc=566,freq=4.0), product of:
            0.08242767 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.04338591 = queryNorm
            0.1484275 = fieldWeight in 566, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.0390625 = fieldNorm(doc=566)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Language: d

Schek, M.: Automatische Klassifizierung in Erschließung und Recherche eines Pressearchivs (2006) 0.00

0.0024469066 = product of:
  0.0048938133 = sum of:
    0.0048938133 = product of:
      0.009787627 = sum of:
        0.009787627 = weight(_text_:d in 6043) [ClassicSimilarity], result of:
          0.009787627 = score(doc=6043,freq=4.0), product of:
            0.08242767 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.04338591 = queryNorm
            0.118742 = fieldWeight in 6043, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.03125 = fieldNorm(doc=6043)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Language: d
Location: D

Zhang, J.; Mostafa, J.; Tripathy, H.: Information retrieval by semantic analysis and visualization of the concept space of D-Lib® magazine (2002) 0.00
```
0.002418062 = product of:
  0.004836124 = sum of:
    0.004836124 = product of:
      0.009672248 = sum of:
        0.009672248 = weight(_text_:d in 1211) [ClassicSimilarity], result of:
          0.009672248 = score(doc=1211,freq=10.0), product of:
            0.08242767 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.04338591 = queryNorm
            0.11734224 = fieldWeight in 1211, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.01953125 = fieldNorm(doc=1211)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

In this article we present a method for retrieving documents from a digital library through a visual interface based on automatically generated concepts. We used a vocabulary generation algorithm to generate a set of concepts for the digital library and a technique called the max-min distance technique to cluster them. Additionally, the concepts were visualized in a spring embedding graph layout to depict the semantic relationship among them. The resulting graph layout serves as an aid to users for retrieving documents. An online archive containing the contents of D-Lib Magazine from July 1995 to May 2002 was used to test the utility of an implemented retrieval and visualization system. We believe that the method developed and tested can be applied to many different domains to help users get a better understanding of online document collections and to minimize users' cognitive load during execution of search tasks. Over the past few years, the volume of information available through the World Wide Web has been expanding exponentially. Never has so much information been so readily available and shared among so many people. Unfortunately, the unstructured nature and huge volume of information accessible over networks have made it hard for users to sift through and find relevant information. To deal with this problem, information retrieval (IR) techniques have gained more intensive attention from both industrial and academic researchers. Numerous IR techniques have been developed to help deal with the information overload problem. These techniques concentrate on mathematical models and algorithms for retrieval. Popular IR models such as the Boolean model, the vector-space model, the probabilistic model and their variants are well established.
Nevertheless, because thesaurus use has shown to improve retrieval, for our method we integrate functions in the search interface that permit users to explore built-in search vocabularies to improve retrieval from digital libraries. Our method automatically generates the terms and their semantic relationships representing relevant topics covered in a digital library. We call these generated terms the "concepts", and the generated terms and their semantic relationships we call the "concept space". Additionally, we used a visualization technique to display the concept space and allow users to interact with this space. The automatically generated term set is considered to be more representative of subject area in a corpus than an "externally" imposed thesaurus, and our method has the potential of saving a significant amount of time and labor for those who have been manually creating thesauri as well. Information visualization is an emerging discipline and developed very quickly in the last decade. With growing volumes of documents and associated complexities, information visualization has become increasingly important. Researchers have found information visualization to be an effective way to use and understand information while minimizing a user's cognitive load. Our work was based on an algorithmic approach of concept discovery and association. Concepts are discovered using an algorithm based on an automated thesaurus generation procedure. Subsequently, similarities among terms are computed using the cosine measure, and the associations among terms are established using a method known as max-min distance clustering. The concept space is then visualized in a spring embedding graph, which roughly shows the semantic relationships among concepts in a 2-D visual representation. The semantic space of the visualization is used as a medium for users to retrieve the desired documents. In the remainder of this article, we present our algorithmic approach of concept generation and clustering, followed by description of the visualization technique and interactive interface. The paper ends with key conclusions and discussions on future work.

Content

The JAVA applet is available at <http://ella.slis.indiana.edu/~junzhang/dlib/IV.html>. A prototype of this interface has been developed and is available at <http://ella.slis.indiana.edu/~junzhang/dlib/IV.html>. The D-Lib search interface is available at <http://www.dlib.org/Architext/AT-dlib2query.html>.

Source

D-Lib magazine. 8(2002) no.10, x S

Rudolph, S.; Hemmje, M.: Visualisierung von Thesauri zur interaktiven Unterstüzung von visuellen Anfragen an Textdatenbanken (1994) 0.00

0.0021627804 = product of:
  0.0043255608 = sum of:
    0.0043255608 = product of:
      0.0086511215 = sum of:
        0.0086511215 = weight(_text_:d in 2382) [ClassicSimilarity], result of:
          0.0086511215 = score(doc=2382,freq=2.0), product of:
            0.08242767 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.04338591 = queryNorm
            0.104954086 = fieldWeight in 2382, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2382)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Language: d

Cao, N.; Sun, J.; Lin, Y.-R.; Gotz, D.; Liu, S.; Qu, H.: FacetAtlas : Multifaceted visualization for rich text corpora (2010) 0.00

0.0021627804 = product of:
  0.0043255608 = sum of:
    0.0043255608 = product of:
      0.0086511215 = sum of:
        0.0086511215 = weight(_text_:d in 3366) [ClassicSimilarity], result of:
          0.0086511215 = score(doc=3366,freq=2.0), product of:
            0.08242767 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.04338591 = queryNorm
            0.104954086 = fieldWeight in 3366, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3366)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Search (6 results, page 1 of 1)

Authors

Years

Languages

Types

Themes