Search (4 results, page 1 of 1)

Survey of text mining : clustering, classification, and retrieval (2004) 0.00
```
0.00439501 = product of:
  0.02637006 = sum of:
    0.02637006 = product of:
      0.05274012 = sum of:
        0.05274012 = weight(_text_:theory in 804) [ClassicSimilarity], result of:
          0.05274012 = score(doc=804,freq=4.0), product of:
            0.16234003 = queryWeight, product of:
              4.1583924 = idf(docFreq=1878, maxDocs=44218)
              0.03903913 = queryNorm
            0.3248744 = fieldWeight in 804, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              4.1583924 = idf(docFreq=1878, maxDocs=44218)
              0.0390625 = fieldNorm(doc=804)
      0.5 = coord(1/2)
  0.16666667 = coord(1/6)
```
Abstract

Extracting content from text continues to be an important research problem for information processing and management. Approaches to capture the semantics of text-based document collections may be based on Bayesian models, probability theory, vector space models, statistical models, or even graph theory. As the volume of digitized textual media continues to grow, so does the need for designing robust, scalable indexing and search strategies (software) to meet a variety of user needs. Knowledge extraction or creation from text requires systematic yet reliable processing that can be codified and adapted for changing needs and environments. This book will draw upon experts in both academia and industry to recommend practical approaches to the purification, indexing, and mining of textual information. It will address document identification, clustering and categorizing documents, cleaning text, and visualizing semantic models of text.
Blair, D.C.: Language and representation in information retrieval (1991) 0.00
```
0.0035160081 = product of:
  0.021096049 = sum of:
    0.021096049 = product of:
      0.042192098 = sum of:
        0.042192098 = weight(_text_:theory in 1545) [ClassicSimilarity], result of:
          0.042192098 = score(doc=1545,freq=4.0), product of:
            0.16234003 = queryWeight, product of:
              4.1583924 = idf(docFreq=1878, maxDocs=44218)
              0.03903913 = queryNorm
            0.25989953 = fieldWeight in 1545, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              4.1583924 = idf(docFreq=1878, maxDocs=44218)
              0.03125 = fieldNorm(doc=1545)
      0.5 = coord(1/2)
  0.16666667 = coord(1/6)
```
Abstract

Information or Document Retrieval is the subject of this book. It is not an introductory book, although it is self-contained in the sense that it is not necessary to have a background in the theory or practice of Information Retrieval in order to understand its arguments. The book presents, as clearly as possible, one particular perspective on Information Retrieval, and attempts to say that certain aspects of the theory or practice of the management of documents are more important than others. The majority of Information Retrieval research has been aimed at the more experimentally tractable small-scale systems, and although much of that work has added greatly to our understanding of Information Retrieval it is becoming increasingly apparent that retrieval systems with large data bases of documents are a fundamentally different genre of systems than small-scale systems. If this is so, which is the thesis of this book, then we must now study large information retrieval systems with the same rigor and intensity that we once studied small-scale systems. Hegel observed that the quantitative growth of any system caused qualitative changes to take place in its structure and processes.

Schaarschmidt, R.: Archivierung in Datenbanksystemen : Konzept und Sprache (2001) 0.00

0.0026686378 = product of:
  0.016011827 = sum of:
    0.016011827 = product of:
      0.032023653 = sum of:
        0.032023653 = weight(_text_:29 in 3912) [ClassicSimilarity], result of:
          0.032023653 = score(doc=3912,freq=2.0), product of:
            0.13732746 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.03903913 = queryNorm
            0.23319192 = fieldWeight in 3912, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.046875 = fieldNorm(doc=3912)
      0.5 = coord(1/2)
  0.16666667 = coord(1/6)

Date: 21. 3.2008 18:41:29

Grossman, D.A.; Frieder, O.: Information retrieval : algorithms and heuristics (2004) 0.00
```
0.0023239423 = product of:
  0.013943653 = sum of:
    0.013943653 = product of:
      0.027887305 = sum of:
        0.027887305 = weight(_text_:methods in 1486) [ClassicSimilarity], result of:
          0.027887305 = score(doc=1486,freq=2.0), product of:
            0.15695344 = queryWeight, product of:
              4.0204134 = idf(docFreq=2156, maxDocs=44218)
              0.03903913 = queryNorm
            0.17767884 = fieldWeight in 1486, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.0204134 = idf(docFreq=2156, maxDocs=44218)
              0.03125 = fieldNorm(doc=1486)
      0.5 = coord(1/2)
  0.16666667 = coord(1/6)
```
Abstract

Interested in how an efficient search engine works? Want to know what algorithms are used to rank resulting documents in response to user requests? The authors answer these and other key information on retrieval design and implementation questions is provided. This book is not yet another high level text. Instead, algorithms are thoroughly described, making this book ideally suited for both computer science students and practitioners who work on search-related applications. As stated in the foreword, this book provides a current, broad, and detailed overview of the field and is the only one that does so. Examples are used throughout to illustrate the algorithms. The authors explain how a query is ranked against a document collection using either a single or a combination of retrieval strategies, and how an assortment of utilities are integrated into the query processing scheme to improve these rankings. Methods for building and compressing text indexes, querying and retrieving documents in multiple languages, and using parallel or distributed processing to expedite the search are likewise described. This edition is a major expansion of the one published in 1998. Neuaufl. 2005: Besides updating the entire book with current techniques, it includes new sections on language models, cross-language information retrieval, peer-to-peer processing, XML search, mediators, and duplicate document detection.

Search (4 results, page 1 of 1)

Authors

Years

Languages

Types

Themes

Subjects

Classifications