Search (5 results, page 1 of 1)

Information retrieval : data structures and algorithms (1992) 0.02
```
0.019471738 = sum of:
  0.01522842 = product of:
    0.06091368 = sum of:
      0.06091368 = weight(_text_:authors in 3495) [ClassicSimilarity], result of:
        0.06091368 = score(doc=3495,freq=2.0), product of:
          0.2418733 = queryWeight, product of:
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.053056188 = queryNorm
          0.25184128 = fieldWeight in 3495, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.0390625 = fieldNorm(doc=3495)
    0.25 = coord(1/4)
  0.0042433185 = product of:
    0.008486637 = sum of:
      0.008486637 = weight(_text_:s in 3495) [ClassicSimilarity], result of:
        0.008486637 = score(doc=3495,freq=12.0), product of:
          0.057684682 = queryWeight, product of:
            1.0872376 = idf(docFreq=40523, maxDocs=44218)
            0.053056188 = queryNorm
          0.14712115 = fieldWeight in 3495, product of:
            3.4641016 = tf(freq=12.0), with freq of:
              12.0 = termFreq=12.0
            1.0872376 = idf(docFreq=40523, maxDocs=44218)
            0.0390625 = fieldNorm(doc=3495)
    0.5 = coord(1/2)
```
Abstract

The book consists of separate chapters by some 20 different authors. It covers many of the information retrieval algorithms, including methods of file organization, file search and access, and query processing

Content

An edited volume containing data structures and algorithms for information retrieval including a disk with examples written in C. for prgrammers and students interested in parsing text, automated indexing, its the first collection in book form of the basic data structures and algorithms that are critical to the storage and retrieval of documents. ------------------Enthält die Kapitel: FRAKES, W.B.: Introduction to information storage and retrieval systems; BAEZA-YATES, R.S.: Introduction to data structures and algorithms related to information retrieval; HARMAN, D. u.a.: Inverted files; FALOUTSOS, C.: Signature files; GONNET, G.H. u.a.: New indices for text: PAT trees and PAT arrays; FORD, D.A. u. S. CHRISTODOULAKIS: File organizations for optical disks; FOX, C.: Lexical analysis and stoplists; FRAKES, W.B.: Stemming algorithms; SRINIVASAN, P.: Thesaurus construction; BAEZA-YATES, R.A.: String searching algorithms; HARMAN, D.: Relevance feedback and other query modification techniques; WARTIK, S.: Boolean operators; WARTIK, S. u.a.: Hashing algorithms; HARMAN, D.: Ranking algorithms; FOX, E.: u.a.: Extended Boolean models; RASMUSSEN, E.: Clustering algorithms; HOLLAAR, L.: Special-purpose hardware for information retrieval; STANFILL, C.: Parallel information retrieval algorithms

Footnote

Rez. in: Computing reviews. July 1993, S.341-342 (G. Salton)

Pages

504 S

Type

s
Cross-language information retrieval (1998) 0.01
```
0.00911445 = sum of:
  0.00761421 = product of:
    0.03045684 = sum of:
      0.03045684 = weight(_text_:authors in 6299) [ClassicSimilarity], result of:
        0.03045684 = score(doc=6299,freq=2.0), product of:
          0.2418733 = queryWeight, product of:
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.053056188 = queryNorm
          0.12592064 = fieldWeight in 6299, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.01953125 = fieldNorm(doc=6299)
    0.25 = coord(1/4)
  0.0015002397 = product of:
    0.0030004794 = sum of:
      0.0030004794 = weight(_text_:s in 6299) [ClassicSimilarity], result of:
        0.0030004794 = score(doc=6299,freq=6.0), product of:
          0.057684682 = queryWeight, product of:
            1.0872376 = idf(docFreq=40523, maxDocs=44218)
            0.053056188 = queryNorm
          0.052015185 = fieldWeight in 6299, product of:
            2.4494898 = tf(freq=6.0), with freq of:
              6.0 = termFreq=6.0
            1.0872376 = idf(docFreq=40523, maxDocs=44218)
            0.01953125 = fieldNorm(doc=6299)
    0.5 = coord(1/2)
```
Footnote

Rez. in: Machine translation review: 1999, no.10, S.26-27 (D. Lewis): "Cross Language Information Retrieval (CLIR) addresses the growing need to access large volumes of data across language boundaries. The typical requirement is for the user to input a free form query, usually a brief description of a topic, into a search or retrieval engine which returns a list, in ranked order, of documents or web pages that are relevant to the topic. The search engine matches the terms in the query to indexed terms, usually keywords previously derived from the target documents. Unlike monolingual information retrieval, CLIR requires query terms in one language to be matched to indexed terms in another. Matching can be done by bilingual dictionary lookup, full machine translation, or by applying statistical methods. A query's success is measured in terms of recall (how many potentially relevant target documents are found) and precision (what proportion of documents found are relevant). Issues in CLIR are how to translate query terms into index terms, how to eliminate alternative translations (e.g. to decide that French 'traitement' in a query means 'treatment' and not 'salary'), and how to rank or weight translation alternatives that are retained (e.g. how to order the French terms 'aventure', 'business', 'affaire', and 'liaison' as relevant translations of English 'affair'). Grefenstette provides a lucid and useful overview of the field and the problems. The volume brings together a number of experiments and projects in CLIR. Mark Davies (New Mexico State University) describes Recuerdo, a Spanish retrieval engine which reduces translation ambiguities by scanning indexes for parallel texts; it also uses either a bilingual dictionary or direct equivalents from a parallel corpus in order to compare results for queries on parallel texts. Lisa Ballesteros and Bruce Croft (University of Massachusetts) use a 'local feedback' technique which automatically enhances a query by adding extra terms to it both before and after translation; such terms can be derived from documents known to be relevant to the query.
The retrieved output from a query including the phrase 'big rockets' may be, for instance, a sentence containing 'giant rocket' which is semantically ranked above 'military ocket'. David Hull (Xerox Research Centre, Grenoble) describes an implementation of a weighted Boolean model for Spanish-English CLIR. Users construct Boolean-type queries, weighting each term in the query, which is then translated by an on-line dictionary before being applied to the database. Comparisons with the performance of unweighted free-form queries ('vector space' models) proved encouraging. Two contributions consider the evaluation of CLIR systems. In order to by-pass the time-consuming and expensive process of assembling a standard collection of documents and of user queries against which the performance of an CLIR system is manually assessed, Páriac Sheridan et al (ETH Zurich) propose a method based on retrieving 'seed documents'. This involves identifying a unique document in a database (the 'seed document') and, for a number of queries, measuring how fast it is retrieved. The authors have also assembled a large database of multilingual news documents for testing purposes. By storing the (fairly short) documents in a structured form tagged with descriptor codes (e.g. for topic, country and area), the test suite is easily expanded while remaining consistent for the purposes of testing. Douglas Ouard and Bonne Dorr (University of Maryland) describe an evaluation methodology which appears to apply LSI techniques in order to filter and rank incoming documents designed for testing CLIR systems. The volume provides the reader an excellent overview of several projects in CLIR. It is well supported with references and is intended as a secondary text for researchers and practitioners. It highlights the need for a good, general tutorial introduction to the field."

Pages

VII,182 S

Type

s

Brenner, E.H.: Beyond Boolean : new approaches in information retrieval; the quest for intuitive online search systems past, present & future (1995) 0.00

0.0017149168 = product of:
  0.0034298336 = sum of:
    0.0034298336 = product of:
      0.006859667 = sum of:
        0.006859667 = weight(_text_:s in 2547) [ClassicSimilarity], result of:
          0.006859667 = score(doc=2547,freq=4.0), product of:
            0.057684682 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.053056188 = queryNorm
            0.118916616 = fieldWeight in 2547, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2547)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Pages: XV,143 S
Type: s

Computational information retrieval (2001) 0.00

0.0014699287 = product of:
  0.0029398573 = sum of:
    0.0029398573 = product of:
      0.0058797146 = sum of:
        0.0058797146 = weight(_text_:s in 4167) [ClassicSimilarity], result of:
          0.0058797146 = score(doc=4167,freq=4.0), product of:
            0.057684682 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.053056188 = queryNorm
            0.101928525 = fieldWeight in 4167, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.046875 = fieldNorm(doc=4167)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Pages: XII,185 S
Type: s

Effektive Information Retrieval Verfahren in Theorie und Praxis : ausgewählte und erweiterte Beiträge des Vierten Hildesheimer Evaluierungs- und Retrievalworkshop (HIER 2005), Hildesheim, 20.7.2005 (2006) 0.00
```
6.000959E-4 = product of:
  0.0012001918 = sum of:
    0.0012001918 = product of:
      0.0024003836 = sum of:
        0.0024003836 = weight(_text_:s in 5973) [ClassicSimilarity], result of:
          0.0024003836 = score(doc=5973,freq=6.0), product of:
            0.057684682 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.053056188 = queryNorm
            0.04161215 = fieldWeight in 5973, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.015625 = fieldNorm(doc=5973)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Footnote

Rez. in: Information - Wissenschaft und Praxis 57(2006) H.5, S.290-291 (C. Schindler): "Weniger als ein Jahr nach dem "Vierten Hildesheimer Evaluierungs- und Retrievalworkshop" (HIER 2005) im Juli 2005 ist der dazugehörige Tagungsband erschienen. Eingeladen hatte die Hildesheimer Informationswissenschaft um ihre Forschungsergebnisse und die einiger externer Experten zum Thema Information Retrieval einem Fachpublikum zu präsentieren und zur Diskussion zu stellen. Unter dem Titel "Effektive Information Retrieval Verfahren in Theorie und Praxis" sind nahezu sämtliche Beiträge des Workshops in dem nun erschienenen, 15 Beiträge umfassenden Band gesammelt. Mit dem Schwerpunkt Information Retrieval (IR) wird ein Teilgebiet der Informationswissenschaft vorgestellt, das schon immer im Zentrum informationswissenschaftlicher Forschung steht. Ob durch den Leistungsanstieg von Prozessoren und Speichermedien, durch die Verbreitung des Internet über nationale Grenzen hinweg oder durch den stetigen Anstieg der Wissensproduktion, festzuhalten ist, dass in einer zunehmend wechselseitig vernetzten Welt die Orientierung und das Auffinden von Dokumenten in großen Wissensbeständen zu einer zentralen Herausforderung geworden sind. Aktuelle Verfahrensweisen zu diesem Thema, dem Information Retrieval, präsentiert der neue Band anhand von praxisbezogenen Projekten und theoretischen Diskussionen. Das Kernthema Information Retrieval wird in dem Sammelband in die Bereiche Retrieval-Systeme, Digitale Bibliothek, Evaluierung und Multilinguale Systeme untergliedert. Die Artikel der einzelnen Sektionen sind insgesamt recht heterogen und bieten daher keine Überschneidungen inhaltlicher Art. Jedoch ist eine vollkommene thematische Abdeckung der unterschiedlichen Bereiche ebenfalls nicht gegeben, was bei der Präsentation von Forschungsergebnissen eines Institutes und seiner Kooperationspartner auch nur bedingt erwartet werden kann. So lässt sich sowohl in der Gliederung als auch in den einzelnen Beiträgen eine thematische Verdichtung erkennen, die das spezielle Profil und die Besonderheit der Hildesheimer Informationswissenschaft im Feld des Information Retrieval wiedergibt. Teil davon ist die mehrsprachige und interdisziplinäre Ausrichtung, die die Schnittstellen zwischen Informationswissenschaft, Sprachwissenschaft und Informatik in ihrer praxisbezogenen und internationalen Forschung fokussiert.

Pages

VIII, 244 S

Type

s

Search (5 results, page 1 of 1)

Years

Languages

Themes