Search (3 results, page 1 of 1)

O'Kane, K.C.: Generating hierarchical document indices from common denominators in large document collections (1996) 0.03
```
0.026352067 = product of:
  0.13176033 = sum of:
    0.13176033 = weight(_text_:index in 4037) [ClassicSimilarity], result of:
      0.13176033 = score(doc=4037,freq=6.0), product of:
        0.2250935 = queryWeight, product of:
          4.369764 = idf(docFreq=1520, maxDocs=44218)
          0.051511593 = queryNorm
        0.5853582 = fieldWeight in 4037, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          4.369764 = idf(docFreq=1520, maxDocs=44218)
          0.0546875 = fieldNorm(doc=4037)
  0.2 = coord(1/5)
```
Abstract

Describes an effective, simple and efficient algorithm for computer generation of hierarchical indices from Document Term matrices by means of calculating common denominator vectors from the document vector set. This procedure produces an intuitive, user friendly hierarchical index of a document collection not unlike that which would be expected had a manual indexer set about to create an index or outline of a collection. The resulting index, when presented with a graphical user interface, provides the user with a natural easily comprehended view of the document collection, permits general browsing and informal search activities with an access method that requires no keyboard entry or prior knowledge of the vocabulary
O'Kane, K.C.; Lockner, M.J.: Indexing genomic sequence libraries (2005) 0.02
```
0.015214371 = product of:
  0.07607185 = sum of:
    0.07607185 = weight(_text_:index in 1009) [ClassicSimilarity], result of:
      0.07607185 = score(doc=1009,freq=2.0), product of:
        0.2250935 = queryWeight, product of:
          4.369764 = idf(docFreq=1520, maxDocs=44218)
          0.051511593 = queryNorm
        0.33795667 = fieldWeight in 1009, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.369764 = idf(docFreq=1520, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1009)
  0.2 = coord(1/5)
```
Abstract

This paper describes an extensible, open-source (GPL) data repository and retrieval system that supports fast, efficient, keyword based retrieval of genomic sequences from multiple libraries with retrieved sequences post-processed by FASTA, Smith-Waterman and other analysis software. This application is implemented for Linux and is written in Mumps, C, and C++ with supporting components that include the Berkeley Data Base, the Perl Compatible Regular Expression Library, GLADE, and tools such as FASTA, Smith-Waterman, and modules from EMBOSS. The package described here can quickly index data sets of up to 256 terabytes using a B-tree based multi-dimensional data model. An example is presented that indexes the text of the full NCBI Genbank library.

O'Kane, K.C.: World Wide Web-based information storage and retrieval (1996) 0.01

0.009770754 = product of:
  0.04885377 = sum of:
    0.04885377 = weight(_text_:22 in 4737) [ClassicSimilarity], result of:
      0.04885377 = score(doc=4737,freq=2.0), product of:
        0.18038483 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.051511593 = queryNorm
        0.2708308 = fieldWeight in 4737, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.0546875 = fieldNorm(doc=4737)
  0.2 = coord(1/5)

Date: 1. 8.1996 22:13:07

Search (3 results, page 1 of 1)

Years

Themes