Search (4 results, page 1 of 1)

  • × theme_ss:"Semantisches Umfeld in Indexierung u. Retrieval"
  1. Gödert, W.: Inhaltliche Dokumenterschließung, Information Retrieval und Navigation in Informationsräumen (1995) 0.10
    0.09869291 = product of:
      0.19738582 = sum of:
        0.112538084 = weight(_text_:storage in 4438) [ClassicSimilarity], result of:
          0.112538084 = score(doc=4438,freq=2.0), product of:
            0.23366846 = queryWeight, product of:
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.04288404 = queryNorm
            0.48161435 = fieldWeight in 4438, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.0625 = fieldNorm(doc=4438)
        0.049049217 = weight(_text_:retrieval in 4438) [ClassicSimilarity], result of:
          0.049049217 = score(doc=4438,freq=4.0), product of:
            0.12972058 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.04288404 = queryNorm
            0.37811437 = fieldWeight in 4438, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0625 = fieldNorm(doc=4438)
        0.03579852 = weight(_text_:systems in 4438) [ClassicSimilarity], result of:
          0.03579852 = score(doc=4438,freq=2.0), product of:
            0.13179013 = queryWeight, product of:
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.04288404 = queryNorm
            0.2716328 = fieldWeight in 4438, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.0625 = fieldNorm(doc=4438)
      0.5 = coord(3/6)
    
    Abstract
    Examines the advantages and disadvantages of precoordinated, postcoordinated and automatic indexing with regard to existing information storage systems, such as card catalogues, OPACs, CR-ROM databases, and online databases. Presents a general model of document content representation and concludes that the library profession needs to address the development of databank design models, relevance feedback methods and automatic indexing assessment methods, to make indexing more effective
    Theme
    Semantisches Umfeld in Indexierung u. Retrieval
  2. Baofu, P.: ¬The future of information architecture : conceiving a better way to understand taxonomy, network, and intelligence (2008) 0.09
    0.08979165 = product of:
      0.1795833 = sum of:
        0.099470556 = weight(_text_:storage in 2257) [ClassicSimilarity], result of:
          0.099470556 = score(doc=2257,freq=4.0), product of:
            0.23366846 = queryWeight, product of:
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.04288404 = queryNorm
            0.42569098 = fieldWeight in 2257, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2257)
        0.048471015 = weight(_text_:retrieval in 2257) [ClassicSimilarity], result of:
          0.048471015 = score(doc=2257,freq=10.0), product of:
            0.12972058 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.04288404 = queryNorm
            0.37365708 = fieldWeight in 2257, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2257)
        0.03164172 = weight(_text_:systems in 2257) [ClassicSimilarity], result of:
          0.03164172 = score(doc=2257,freq=4.0), product of:
            0.13179013 = queryWeight, product of:
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.04288404 = queryNorm
            0.24009174 = fieldWeight in 2257, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2257)
      0.5 = coord(3/6)
    
    LCSH
    Information storage and retrieval systems
    RSWK
    Suchmaschine / Information Retrieval
    Subject
    Information storage and retrieval systems
    Suchmaschine / Information Retrieval
    Theme
    Semantisches Umfeld in Indexierung u. Retrieval
  3. Morato, J.; Llorens, J.; Genova, G.; Moreiro, J.A.: Experiments in discourse analysis impact on information classification and retrieval algorithms (2003) 0.07
    0.068032086 = product of:
      0.13606417 = sum of:
        0.070336305 = weight(_text_:storage in 1083) [ClassicSimilarity], result of:
          0.070336305 = score(doc=1083,freq=2.0), product of:
            0.23366846 = queryWeight, product of:
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.04288404 = queryNorm
            0.30100897 = fieldWeight in 1083, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1083)
        0.043353792 = weight(_text_:retrieval in 1083) [ClassicSimilarity], result of:
          0.043353792 = score(doc=1083,freq=8.0), product of:
            0.12972058 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.04288404 = queryNorm
            0.33420905 = fieldWeight in 1083, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1083)
        0.022374075 = weight(_text_:systems in 1083) [ClassicSimilarity], result of:
          0.022374075 = score(doc=1083,freq=2.0), product of:
            0.13179013 = queryWeight, product of:
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.04288404 = queryNorm
            0.1697705 = fieldWeight in 1083, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1083)
      0.5 = coord(3/6)
    
    Abstract
    Researchers in indexing and retrieval systems have been advocating the inclusion of more contextual information to improve results. The proliferation of full-text databases and advances in computer storage capacity have made it possible to carry out text analysis by means of linguistic and extra-linguistic knowledge. Since the mid 80s, research has tended to pay more attention to context, giving discourse analysis a more central role. The research presented in this paper aims to check whether discourse variables have an impact on modern information retrieval and classification algorithms. In order to evaluate this hypothesis, a functional framework for information analysis in an automated environment has been proposed, where the n-grams (filtering) and the k-means and Chen's classification algorithms have been tested against sub-collections of documents based on the following discourse variables: "Genre", "Register", "Domain terminology", and "Document structure". The results obtained with the algorithms for the different sub-collections were compared to the MeSH information structure. These demonstrate that n-grams does not appear to have a clear dependence on discourse variables, though the k-means classification algorithm does, but only on domain terminology and document structure, and finally Chen's algorithm has a clear dependence on all of the discourse variables. This information could be used to design better classification algorithms, where discourse variables should be taken into account. Other minor conclusions drawn from these results are also presented.
    Theme
    Semantisches Umfeld in Indexierung u. Retrieval
  4. Gao, J.; Zhang, J.: Clustered SVD strategies in latent semantic indexing (2005) 0.06
    0.055443417 = product of:
      0.16633025 = sum of:
        0.09847082 = weight(_text_:storage in 1166) [ClassicSimilarity], result of:
          0.09847082 = score(doc=1166,freq=2.0), product of:
            0.23366846 = queryWeight, product of:
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.04288404 = queryNorm
            0.42141256 = fieldWeight in 1166, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1166)
        0.06785942 = weight(_text_:retrieval in 1166) [ClassicSimilarity], result of:
          0.06785942 = score(doc=1166,freq=10.0), product of:
            0.12972058 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.04288404 = queryNorm
            0.5231199 = fieldWeight in 1166, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1166)
      0.33333334 = coord(2/6)
    
    Abstract
    The text retrieval method using latent semantic indexing (LSI) technique with truncated singular value decomposition (SVD) has been intensively studied in recent years. The SVD reduces the noise contained in the original representation of the term-document matrix and improves the information retrieval accuracy. Recent studies indicate that SVD is mostly useful for small homogeneous data collections. For large inhomogeneous datasets, the performance of the SVD based text retrieval technique may deteriorate. We propose to partition a large inhomogeneous dataset into several smaller ones with clustered structure, on which we apply the truncated SVD. Our experimental results show that the clustered SVD strategies may enhance the retrieval accuracy and reduce the computing and storage costs.
    Theme
    Semantisches Umfeld in Indexierung u. Retrieval