Search (319 results, page 1 of 16)

Bornmann, L.; Mutz, R.: From P100 to P100' : a new citation-rank approach (2014) 0.07

0.06715009 = product of:
  0.10072514 = sum of:
    0.01208026 = weight(_text_:in in 1431) [ClassicSimilarity], result of:
      0.01208026 = score(doc=1431,freq=4.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.17003182 = fieldWeight in 1431, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0625 = fieldNorm(doc=1431)
    0.08864488 = sum of:
      0.032032568 = weight(_text_:science in 1431) [ClassicSimilarity], result of:
        0.032032568 = score(doc=1431,freq=2.0), product of:
          0.1375819 = queryWeight, product of:
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.052230705 = queryNorm
          0.23282544 = fieldWeight in 1431, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.0625 = fieldNorm(doc=1431)
      0.056612313 = weight(_text_:22 in 1431) [ClassicSimilarity], result of:
        0.056612313 = score(doc=1431,freq=2.0), product of:
          0.18290302 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.052230705 = queryNorm
          0.30952093 = fieldWeight in 1431, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0625 = fieldNorm(doc=1431)
  0.6666667 = coord(2/3)

Abstract: Properties of a percentile-based rating scale needed in bibliometrics are formulated. Based on these properties, P100 was recently introduced as a new citation-rank approach (Bornmann, Leydesdorff, & Wang, 2013). In this paper, we conceptualize P100 and propose an improvement which we call P100'. Advantages and disadvantages of citation-rank indicators are noted.
Date: 22. 8.2014 17:05:18
Source: Journal of the Association for Information Science and Technology. 65(2014) no.9, S.1939-1943

Losada, D.E.; Barreiro, A.: Emebedding term similarity and inverse document frequency into a logical model of information retrieval (2003) 0.06

0.06479128 = product of:
  0.09718691 = sum of:
    0.008542033 = weight(_text_:in in 1422) [ClassicSimilarity], result of:
      0.008542033 = score(doc=1422,freq=2.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.120230645 = fieldWeight in 1422, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0625 = fieldNorm(doc=1422)
    0.08864488 = sum of:
      0.032032568 = weight(_text_:science in 1422) [ClassicSimilarity], result of:
        0.032032568 = score(doc=1422,freq=2.0), product of:
          0.1375819 = queryWeight, product of:
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.052230705 = queryNorm
          0.23282544 = fieldWeight in 1422, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.0625 = fieldNorm(doc=1422)
      0.056612313 = weight(_text_:22 in 1422) [ClassicSimilarity], result of:
        0.056612313 = score(doc=1422,freq=2.0), product of:
          0.18290302 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.052230705 = queryNorm
          0.30952093 = fieldWeight in 1422, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0625 = fieldNorm(doc=1422)
  0.6666667 = coord(2/3)

Date: 22. 3.2003 19:27:23
Footnote: Beitrag eines Themenheftes: Mathematical, logical, and formal methods in information retrieval
Source: Journal of the American Society for Information Science and technology. 54(2003) no.4, S.285-301

Klas, C.-P.; Fuhr, N.; Schaefer, A.: Evaluating strategic support for information access in the DAFFODIL system (2004) 0.05

0.053872727 = product of:
  0.08080909 = sum of:
    0.014325427 = weight(_text_:in in 2419) [ClassicSimilarity], result of:
      0.014325427 = score(doc=2419,freq=10.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.20163295 = fieldWeight in 2419, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.046875 = fieldNorm(doc=2419)
    0.06648366 = sum of:
      0.024024425 = weight(_text_:science in 2419) [ClassicSimilarity], result of:
        0.024024425 = score(doc=2419,freq=2.0), product of:
          0.1375819 = queryWeight, product of:
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.052230705 = queryNorm
          0.17461908 = fieldWeight in 2419, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.046875 = fieldNorm(doc=2419)
      0.042459235 = weight(_text_:22 in 2419) [ClassicSimilarity], result of:
        0.042459235 = score(doc=2419,freq=2.0), product of:
          0.18290302 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.052230705 = queryNorm
          0.23214069 = fieldWeight in 2419, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046875 = fieldNorm(doc=2419)
  0.6666667 = coord(2/3)

Abstract: The digital library system Daffodil is targeted at strategic support of users during the information search process. For searching, exploring and managing digital library objects it provides user-customisable information seeking patterns over a federation of heterogeneous digital libraries. In this paper evaluation results with respect to retrieval effectiveness, efficiency and user satisfaction are presented. The analysis focuses on strategic support for the scientific work-flow. Daffodil supports the whole work-flow, from data source selection over information seeking to the representation, organisation and reuse of information. By embedding high level search functionality into the scientific work-flow, the user experiences better strategic system support due to a more systematic work process. These ideas have been implemented in Daffodil followed by a qualitative evaluation. The evaluation has been conducted with 28 participants, ranging from information seeking novices to experts. The results are promising, as they support the chosen model.
Date: 16.11.2008 16:22:48
Series: Lecture notes in computer science; vol.3232
Theme: Semantisches Umfeld in Indexierung u. Retrieval

Campos, L.M. de; Fernández-Luna, J.M.; Huete, J.F.: Implementing relevance feedback in the Bayesian network retrieval model (2003) 0.05

0.053872727 = product of:
  0.08080909 = sum of:
    0.014325427 = weight(_text_:in in 825) [ClassicSimilarity], result of:
      0.014325427 = score(doc=825,freq=10.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.20163295 = fieldWeight in 825, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.046875 = fieldNorm(doc=825)
    0.06648366 = sum of:
      0.024024425 = weight(_text_:science in 825) [ClassicSimilarity], result of:
        0.024024425 = score(doc=825,freq=2.0), product of:
          0.1375819 = queryWeight, product of:
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.052230705 = queryNorm
          0.17461908 = fieldWeight in 825, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.046875 = fieldNorm(doc=825)
      0.042459235 = weight(_text_:22 in 825) [ClassicSimilarity], result of:
        0.042459235 = score(doc=825,freq=2.0), product of:
          0.18290302 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.052230705 = queryNorm
          0.23214069 = fieldWeight in 825, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046875 = fieldNorm(doc=825)
  0.6666667 = coord(2/3)

Abstract: Relevance Feedback consists in automatically formulating a new query according to the relevance judgments provided by the user after evaluating a set of retrieved documents. In this article, we introduce several relevance feedback methods for the Bayesian Network Retrieval ModeL The theoretical frame an which our methods are based uses the concept of partial evidences, which summarize the new pieces of information gathered after evaluating the results obtained by the original query. These partial evidences are inserted into the underlying Bayesian network and a new inference process (probabilities propagation) is run to compute the posterior relevance probabilities of the documents in the collection given the new query. The quality of the proposed methods is tested using a preliminary experimentation with different standard document collections.
Date: 22. 3.2003 19:30:19
Footnote: Beitrag eines Themenheftes: Mathematical, logical, and formal methods in information retrieval
Source: Journal of the American Society for Information Science and technology. 54(2003) no.4, S.302-313

Baloh, P.; Desouza, K.C.; Hackney, R.: Contextualizing organizational interventions of knowledge management systems : a design science perspectiveA domain analysis (2012) 0.05
```
0.053824343 = product of:
  0.08073651 = sum of:
    0.010677542 = weight(_text_:in in 241) [ClassicSimilarity], result of:
      0.010677542 = score(doc=241,freq=8.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.15028831 = fieldWeight in 241, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0390625 = fieldNorm(doc=241)
    0.07005897 = sum of:
      0.034676272 = weight(_text_:science in 241) [ClassicSimilarity], result of:
        0.034676272 = score(doc=241,freq=6.0), product of:
          0.1375819 = queryWeight, product of:
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.052230705 = queryNorm
          0.25204095 = fieldWeight in 241, product of:
            2.4494898 = tf(freq=6.0), with freq of:
              6.0 = termFreq=6.0
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.0390625 = fieldNorm(doc=241)
      0.035382695 = weight(_text_:22 in 241) [ClassicSimilarity], result of:
        0.035382695 = score(doc=241,freq=2.0), product of:
          0.18290302 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.052230705 = queryNorm
          0.19345059 = fieldWeight in 241, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0390625 = fieldNorm(doc=241)
  0.6666667 = coord(2/3)
```
Abstract

We address how individuals' (workers) knowledge needs influence the design of knowledge management systems (KMS), enabling knowledge creation and utilization. It is evident that KMS technologies and activities are indiscriminately deployed in most organizations with little regard to the actual context of their adoption. Moreover, it is apparent that the extant literature pertaining to knowledge management projects is frequently deficient in identifying the variety of factors indicative for successful KMS. This presents an obvious business practice and research gap that requires a critical analysis of the necessary intervention that will actually improve how workers can leverage and form organization-wide knowledge. This research involved an extensive review of the literature, a grounded theory methodological approach and rigorous data collection and synthesis through an empirical case analysis (Parsons Brinckerhoff and Samsung). The contribution of this study is the formulation of a model for designing KMS based upon the design science paradigm, which aspires to create artifacts that are interdependent of people and organizations. The essential proposition is that KMS design and implementation must be contextualized in relation to knowledge needs and that these will differ for various organizational settings. The findings present valuable insights and further understanding of the way in which KMS design efforts should be focused.

Date

11. 6.2012 14:22:34

Source

Journal of the American Society for Information Science and Technology. 63(2012) no.5, S.948-966

Crestani, F.; Dominich, S.; Lalmas, M.; Rijsbergen, C.J.K. van: Mathematical, logical, and formal methods in information retrieval : an introduction to the special issue (2003) 0.05

0.052864473 = product of:
  0.07929671 = sum of:
    0.012813049 = weight(_text_:in in 1451) [ClassicSimilarity], result of:
      0.012813049 = score(doc=1451,freq=8.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.18034597 = fieldWeight in 1451, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.046875 = fieldNorm(doc=1451)
    0.06648366 = sum of:
      0.024024425 = weight(_text_:science in 1451) [ClassicSimilarity], result of:
        0.024024425 = score(doc=1451,freq=2.0), product of:
          0.1375819 = queryWeight, product of:
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.052230705 = queryNorm
          0.17461908 = fieldWeight in 1451, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.046875 = fieldNorm(doc=1451)
      0.042459235 = weight(_text_:22 in 1451) [ClassicSimilarity], result of:
        0.042459235 = score(doc=1451,freq=2.0), product of:
          0.18290302 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.052230705 = queryNorm
          0.23214069 = fieldWeight in 1451, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046875 = fieldNorm(doc=1451)
  0.6666667 = coord(2/3)

Abstract: Research an the use of mathematical, logical, and formal methods, has been central to Information Retrieval research for a long time. Research in this area is important not only because it helps enhancing retrieval effectiveness, but also because it helps clarifying the underlying concepts of Information Retrieval. In this article we outline some of the major aspects of the subject, and summarize the papers of this special issue with respect to how they relate to these aspects. We conclude by highlighting some directions of future research, which are needed to better understand the formal characteristics of Information Retrieval.
Date: 22. 3.2003 19:27:36
Footnote: Einführung zu den Beiträgen eines Themenheftes: Mathematical, logical, and formal methods in information retrieval
Source: Journal of the American Society for Information Science and technology. 54(2003) no.4, S.281-284

Fan, W.; Fox, E.A.; Pathak, P.; Wu, H.: ¬The effects of fitness functions an genetic programming-based ranking discovery for Web search (2004) 0.05

0.052864473 = product of:
  0.07929671 = sum of:
    0.012813049 = weight(_text_:in in 2239) [ClassicSimilarity], result of:
      0.012813049 = score(doc=2239,freq=8.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.18034597 = fieldWeight in 2239, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.046875 = fieldNorm(doc=2239)
    0.06648366 = sum of:
      0.024024425 = weight(_text_:science in 2239) [ClassicSimilarity], result of:
        0.024024425 = score(doc=2239,freq=2.0), product of:
          0.1375819 = queryWeight, product of:
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.052230705 = queryNorm
          0.17461908 = fieldWeight in 2239, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.046875 = fieldNorm(doc=2239)
      0.042459235 = weight(_text_:22 in 2239) [ClassicSimilarity], result of:
        0.042459235 = score(doc=2239,freq=2.0), product of:
          0.18290302 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.052230705 = queryNorm
          0.23214069 = fieldWeight in 2239, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046875 = fieldNorm(doc=2239)
  0.6666667 = coord(2/3)

Abstract: Genetic-based evolutionary learning algorithms, such as genetic algorithms (GAs) and genetic programming (GP), have been applied to information retrieval (IR) since the 1980s. Recently, GP has been applied to a new IR taskdiscovery of ranking functions for Web search-and has achieved very promising results. However, in our prior research, only one fitness function has been used for GP-based learning. It is unclear how other fitness functions may affect ranking function discovery for Web search, especially since it is weIl known that choosing a proper fitness function is very important for the effectiveness and efficiency of evolutionary algorithms. In this article, we report our experience in contrasting different fitness function designs an GP-based learning using a very large Web corpus. Our results indicate that the design of fitness functions is instrumental in performance improvement. We also give recommendations an the design of fitness functions for genetic-based information retrieval experiments.
Date: 31. 5.2004 19:22:06
Source: Journal of the American Society for Information Science and technology. 55(2004) no.7, S.628-636

Voorhees, E.M.: Implementing agglomerative hierarchic clustering algorithms for use in document retrieval (1986) 0.05

0.049130917 = product of:
  0.073696375 = sum of:
    0.017084066 = weight(_text_:in in 402) [ClassicSimilarity], result of:
      0.017084066 = score(doc=402,freq=2.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.24046129 = fieldWeight in 402, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.125 = fieldNorm(doc=402)
    0.056612313 = product of:
      0.113224626 = sum of:
        0.113224626 = weight(_text_:22 in 402) [ClassicSimilarity], result of:
          0.113224626 = score(doc=402,freq=2.0), product of:
            0.18290302 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.052230705 = queryNorm
            0.61904186 = fieldWeight in 402, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.125 = fieldNorm(doc=402)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Source: Information processing and management. 22(1986) no.6, S.465-476

Dominich, S.: Mathematical foundations of information retrieval (2001) 0.05

0.04602302 = product of:
  0.069034524 = sum of:
    0.005338771 = weight(_text_:in in 1753) [ClassicSimilarity], result of:
      0.005338771 = score(doc=1753,freq=2.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.07514416 = fieldWeight in 1753, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1753)
    0.06369575 = sum of:
      0.028313057 = weight(_text_:science in 1753) [ClassicSimilarity], result of:
        0.028313057 = score(doc=1753,freq=4.0), product of:
          0.1375819 = queryWeight, product of:
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.052230705 = queryNorm
          0.20579056 = fieldWeight in 1753, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1753)
      0.035382695 = weight(_text_:22 in 1753) [ClassicSimilarity], result of:
        0.035382695 = score(doc=1753,freq=2.0), product of:
          0.18290302 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.052230705 = queryNorm
          0.19345059 = fieldWeight in 1753, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1753)
  0.6666667 = coord(2/3)

Abstract: This book offers a comprehensive and consistent mathematical approach to information retrieval (IR) without which no implementation is possible, and sheds an entirely new light upon the structure of IR models. It contains the descriptions of all IR models in a unified formal style and language, along with examples for each, thus offering a comprehensive overview of them. The book also creates mathematical foundations and a consistent mathematical theory (including all mathematical results achieved so far) of IR as a stand-alone mathematical discipline, which thus can be read and taught independently. Also, the book contains all necessary mathematical knowledge on which IR relies, to help the reader avoid searching different sources. The book will be of interest to computer or information scientists, librarians, mathematicians, undergraduate students and researchers whose work involves information retrieval.
Date: 22. 3.2008 12:26:32
LCSH: Computer science / Mathematics
Subject: Computer science / Mathematics

Song, D.; Bruza, P.D.: Towards context sensitive information inference (2003) 0.05
```
0.045653544 = product of:
  0.06848031 = sum of:
    0.013077264 = weight(_text_:in in 1428) [ClassicSimilarity], result of:
      0.013077264 = score(doc=1428,freq=12.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.18406484 = fieldWeight in 1428, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1428)
    0.05540305 = sum of:
      0.020020355 = weight(_text_:science in 1428) [ClassicSimilarity], result of:
        0.020020355 = score(doc=1428,freq=2.0), product of:
          0.1375819 = queryWeight, product of:
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.052230705 = queryNorm
          0.1455159 = fieldWeight in 1428, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1428)
      0.035382695 = weight(_text_:22 in 1428) [ClassicSimilarity], result of:
        0.035382695 = score(doc=1428,freq=2.0), product of:
          0.18290302 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.052230705 = queryNorm
          0.19345059 = fieldWeight in 1428, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1428)
  0.6666667 = coord(2/3)
```
Abstract

Humans can make hasty, but generally robust judgements about what a text fragment is, or is not, about. Such judgements are termed information inference. This article furnishes an account of information inference from a psychologistic stance. By drawing an theories from nonclassical logic and applied cognition, an information inference mechanism is proposed that makes inferences via computations of information flow through an approximation of a conceptual space. Within a conceptual space information is represented geometrically. In this article, geometric representations of words are realized as vectors in a high dimensional semantic space, which is automatically constructed from a text corpus. Two approaches were presented for priming vector representations according to context. The first approach uses a concept combination heuristic to adjust the vector representation of a concept in the light of the representation of another concept. The second approach computes a prototypical concept an the basis of exemplar trace texts and moves it in the dimensional space according to the context. Information inference is evaluated by measuring the effectiveness of query models derived by information flow computations. Results show that information flow contributes significantly to query model effectiveness, particularly with respect to precision. Moreover, retrieval effectiveness compares favorably with two probabilistic query models, and another based an semantic association. More generally, this article can be seen as a contribution towards realizing operational systems that mimic text-based human reasoning.

Date

22. 3.2003 19:35:46

Footnote

Beitrag eines Themenheftes: Mathematical, logical, and formal methods in information retrieval

Source

Journal of the American Society for Information Science and technology. 54(2003) no.4, S.321-334

Theme

Semantisches Umfeld in Indexierung u. Retrieval
Shiri, A.A.; Revie, C.: Query expansion behavior within a thesaurus-enhanced search environment : a user-centered evaluation (2006) 0.04
```
0.044893935 = product of:
  0.0673409 = sum of:
    0.011937855 = weight(_text_:in in 56) [ClassicSimilarity], result of:
      0.011937855 = score(doc=56,freq=10.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.16802745 = fieldWeight in 56, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0390625 = fieldNorm(doc=56)
    0.05540305 = sum of:
      0.020020355 = weight(_text_:science in 56) [ClassicSimilarity], result of:
        0.020020355 = score(doc=56,freq=2.0), product of:
          0.1375819 = queryWeight, product of:
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.052230705 = queryNorm
          0.1455159 = fieldWeight in 56, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.0390625 = fieldNorm(doc=56)
      0.035382695 = weight(_text_:22 in 56) [ClassicSimilarity], result of:
        0.035382695 = score(doc=56,freq=2.0), product of:
          0.18290302 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.052230705 = queryNorm
          0.19345059 = fieldWeight in 56, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0390625 = fieldNorm(doc=56)
  0.6666667 = coord(2/3)
```
Abstract

The study reported here investigated the query expansion behavior of end-users interacting with a thesaurus-enhanced search system on the Web. Two groups, namely academic staff and postgraduate students, were recruited into this study. Data were collected from 90 searches performed by 30 users using the OVID interface to the CAB abstracts database. Data-gathering techniques included questionnaires, screen capturing software, and interviews. The results presented here relate to issues of search-topic and search-term characteristics, number and types of expanded queries, usefulness of thesaurus terms, and behavioral differences between academic staff and postgraduate students in their interaction. The key conclusions drawn were that (a) academic staff chose more narrow and synonymous terms than did postgraduate students, who generally selected broader and related terms; (b) topic complexity affected users' interaction with the thesaurus in that complex topics required more query expansion and search term selection; (c) users' prior topic-search experience appeared to have a significant effect on their selection and evaluation of thesaurus terms; (d) in 50% of the searches where additional terms were suggested from the thesaurus, users stated that they had not been aware of the terms at the beginning of the search; this observation was particularly noticeable in the case of postgraduate students.

Date

22. 7.2006 16:32:43

Source

Journal of the American Society for Information Science and Technology. 57(2006) no.4, S.462-478

Theme

Semantisches Umfeld in Indexierung u. Retrieval

Smeaton, A.F.; Rijsbergen, C.J. van: ¬The retrieval effects of query expansion on a feedback document retrieval system (1983) 0.04

0.042989552 = product of:
  0.06448433 = sum of:
    0.014948557 = weight(_text_:in in 2134) [ClassicSimilarity], result of:
      0.014948557 = score(doc=2134,freq=2.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.21040362 = fieldWeight in 2134, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.109375 = fieldNorm(doc=2134)
    0.049535774 = product of:
      0.09907155 = sum of:
        0.09907155 = weight(_text_:22 in 2134) [ClassicSimilarity], result of:
          0.09907155 = score(doc=2134,freq=2.0), product of:
            0.18290302 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.052230705 = queryNorm
            0.5416616 = fieldWeight in 2134, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=2134)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Date: 30. 3.2001 13:32:22
Theme: Semantisches Umfeld in Indexierung u. Retrieval

Burgin, R.: ¬The retrieval effectiveness of 5 clustering algorithms as a function of indexing exhaustivity (1995) 0.04
```
0.041968808 = product of:
  0.06295321 = sum of:
    0.007550162 = weight(_text_:in in 3365) [ClassicSimilarity], result of:
      0.007550162 = score(doc=3365,freq=4.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.10626988 = fieldWeight in 3365, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3365)
    0.05540305 = sum of:
      0.020020355 = weight(_text_:science in 3365) [ClassicSimilarity], result of:
        0.020020355 = score(doc=3365,freq=2.0), product of:
          0.1375819 = queryWeight, product of:
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.052230705 = queryNorm
          0.1455159 = fieldWeight in 3365, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.0390625 = fieldNorm(doc=3365)
      0.035382695 = weight(_text_:22 in 3365) [ClassicSimilarity], result of:
        0.035382695 = score(doc=3365,freq=2.0), product of:
          0.18290302 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.052230705 = queryNorm
          0.19345059 = fieldWeight in 3365, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0390625 = fieldNorm(doc=3365)
  0.6666667 = coord(2/3)
```
Abstract

The retrieval effectiveness of 5 hierarchical clustering methods (single link, complete link, group average, Ward's method, and weighted average) is examined as a function of indexing exhaustivity with 4 test collections (CR, Cranfield, Medlars, and Time). Evaluations of retrieval effectiveness, based on 3 measures of optimal retrieval performance, confirm earlier findings that the performance of a retrieval system based on single link clustering varies as a function of indexing exhaustivity but fail ti find similar patterns for other clustering methods. The data also confirm earlier findings regarding the poor performance of single link clustering is a retrieval environment. The poor performance of single link clustering appears to derive from that method's tendency to produce a small number of large, ill defined document clusters. By contrast, the data examined here found the retrieval performance of the other clustering methods to be general comparable. The data presented also provides an opportunity to examine the theoretical limits of cluster based retrieval and to compare these theoretical limits to the effectiveness of operational implementations. Performance standards of the 4 document collections examined were found to vary widely, and the effectiveness of operational implementations were found to be in the range defined as unacceptable. Further improvements in search strategies and document representations warrant investigations

Date

22. 2.1996 11:20:06

Source

Journal of the American Society for Information Science. 46(1995) no.8, S.562-572

Soulier, L.; Jabeur, L.B.; Tamine, L.; Bahsoun, W.: On ranking relevant entities in heterogeneous networks using a language-based model (2013) 0.04

0.041968808 = product of:
  0.06295321 = sum of:
    0.007550162 = weight(_text_:in in 664) [ClassicSimilarity], result of:
      0.007550162 = score(doc=664,freq=4.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.10626988 = fieldWeight in 664, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0390625 = fieldNorm(doc=664)
    0.05540305 = sum of:
      0.020020355 = weight(_text_:science in 664) [ClassicSimilarity], result of:
        0.020020355 = score(doc=664,freq=2.0), product of:
          0.1375819 = queryWeight, product of:
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.052230705 = queryNorm
          0.1455159 = fieldWeight in 664, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.0390625 = fieldNorm(doc=664)
      0.035382695 = weight(_text_:22 in 664) [ClassicSimilarity], result of:
        0.035382695 = score(doc=664,freq=2.0), product of:
          0.18290302 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.052230705 = queryNorm
          0.19345059 = fieldWeight in 664, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0390625 = fieldNorm(doc=664)
  0.6666667 = coord(2/3)

Abstract: A new challenge, accessing multiple relevant entities, arises from the availability of linked heterogeneous data. In this article, we address more specifically the problem of accessing relevant entities, such as publications and authors within a bibliographic network, given an information need. We propose a novel algorithm, called BibRank, that estimates a joint relevance of documents and authors within a bibliographic network. This model ranks each type of entity using a score propagation algorithm with respect to the query topic and the structure of the underlying bi-type information entity network. Evidence sources, namely content-based and network-based scores, are both used to estimate the topical similarity between connected entities. For this purpose, authorship relationships are analyzed through a language model-based score on the one hand and on the other hand, non topically related entities of the same type are detected through marginal citations. The article reports the results of experiments using the Bibrank algorithm for an information retrieval task. The CiteSeerX bibliographic data set forms the basis for the topical query automatic generation and evaluation. We show that a statistically significant improvement over closely related ranking models is achieved.
Date: 22. 3.2013 19:34:49
Source: Journal of the American Society for Information Science and Technology. 64(2013) no.3, S.500-515

MacFarlane, A.; Robertson, S.E.; McCann, J.A.: Parallel computing for passage retrieval (2004) 0.03

0.028734263 = product of:
  0.043101393 = sum of:
    0.014795236 = weight(_text_:in in 5108) [ClassicSimilarity], result of:
      0.014795236 = score(doc=5108,freq=6.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.2082456 = fieldWeight in 5108, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0625 = fieldNorm(doc=5108)
    0.028306156 = product of:
      0.056612313 = sum of:
        0.056612313 = weight(_text_:22 in 5108) [ClassicSimilarity], result of:
          0.056612313 = score(doc=5108,freq=2.0), product of:
            0.18290302 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.052230705 = queryNorm
            0.30952093 = fieldWeight in 5108, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=5108)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: In this paper methods for both speeding up passage processing and examining more passages using parallel computers are explored. The number of passages processed are varied in order to examine the effect on retrieval effectiveness and efficiency. The particular algorithm applied has previously been used to good effect in Okapi experiments at TREC. This algorithm and the mechanism for applying parallel computing to speed up processing are described.
Date: 20. 1.2007 18:30:22

Perry, R.; Willett, P.: ¬A revies of the use of inverted files for best match searching in information retrieval systems (1983) 0.03

0.02865137 = product of:
  0.042977054 = sum of:
    0.014948557 = weight(_text_:in in 2701) [ClassicSimilarity], result of:
      0.014948557 = score(doc=2701,freq=2.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.21040362 = fieldWeight in 2701, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.109375 = fieldNorm(doc=2701)
    0.028028497 = product of:
      0.056056995 = sum of:
        0.056056995 = weight(_text_:science in 2701) [ClassicSimilarity], result of:
          0.056056995 = score(doc=2701,freq=2.0), product of:
            0.1375819 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.052230705 = queryNorm
            0.40744454 = fieldWeight in 2701, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.109375 = fieldNorm(doc=2701)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Source: Journal of information science. 6(1983), S.59-66

Ravana, S.D.; Rajagopal, P.; Balakrishnan, V.: Ranking retrieval systems using pseudo relevance judgments (2015) 0.03
```
0.025397738 = product of:
  0.038096607 = sum of:
    0.013077264 = weight(_text_:in in 2591) [ClassicSimilarity], result of:
      0.013077264 = score(doc=2591,freq=12.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.18406484 = fieldWeight in 2591, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2591)
    0.025019344 = product of:
      0.050038688 = sum of:
        0.050038688 = weight(_text_:22 in 2591) [ClassicSimilarity], result of:
          0.050038688 = score(doc=2591,freq=4.0), product of:
            0.18290302 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.052230705 = queryNorm
            0.27358043 = fieldWeight in 2591, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2591)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)
```
Abstract

Purpose In a system-based approach, replicating the web would require large test collections, and judging the relevancy of all documents per topic in creating relevance judgment through human assessors is infeasible. Due to the large amount of documents that requires judgment, there are possible errors introduced by human assessors because of disagreements. The paper aims to discuss these issues. Design/methodology/approach This study explores exponential variation and document ranking methods that generate a reliable set of relevance judgments (pseudo relevance judgments) to reduce human efforts. These methods overcome problems with large amounts of documents for judgment while avoiding human disagreement errors during the judgment process. This study utilizes two key factors: number of occurrences of each document per topic from all the system runs; and document rankings to generate the alternate methods. Findings The effectiveness of the proposed method is evaluated using the correlation coefficient of ranked systems using mean average precision scores between the original Text REtrieval Conference (TREC) relevance judgments and pseudo relevance judgments. The results suggest that the proposed document ranking method with a pool depth of 100 could be a reliable alternative to reduce human effort and disagreement errors involved in generating TREC-like relevance judgments. Originality/value Simple methods proposed in this study show improvement in the correlation coefficient in generating alternate relevance judgment without human assessors while contributing to information retrieval evaluation.

Date

20. 1.2015 18:30:22
18. 9.2018 18:22:56

Kanaeva, Z.: Ranking: Google und CiteSeer (2005) 0.03

0.02514248 = product of:
  0.037713718 = sum of:
    0.012945832 = weight(_text_:in in 3276) [ClassicSimilarity], result of:
      0.012945832 = score(doc=3276,freq=6.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.1822149 = fieldWeight in 3276, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0546875 = fieldNorm(doc=3276)
    0.024767887 = product of:
      0.049535774 = sum of:
        0.049535774 = weight(_text_:22 in 3276) [ClassicSimilarity], result of:
          0.049535774 = score(doc=3276,freq=2.0), product of:
            0.18290302 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.052230705 = queryNorm
            0.2708308 = fieldWeight in 3276, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3276)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: Im Rahmen des klassischen Information Retrieval wurden verschiedene Verfahren für das Ranking sowie die Suche in einer homogenen strukturlosen Dokumentenmenge entwickelt. Die Erfolge der Suchmaschine Google haben gezeigt dass die Suche in einer zwar inhomogenen aber zusammenhängenden Dokumentenmenge wie dem Internet unter Berücksichtigung der Dokumentenverbindungen (Links) sehr effektiv sein kann. Unter den von der Suchmaschine Google realisierten Konzepten ist ein Verfahren zum Ranking von Suchergebnissen (PageRank), das in diesem Artikel kurz erklärt wird. Darüber hinaus wird auf die Konzepte eines Systems namens CiteSeer eingegangen, welches automatisch bibliographische Angaben indexiert (engl. Autonomous Citation Indexing, ACI). Letzteres erzeugt aus einer Menge von nicht vernetzten wissenschaftlichen Dokumenten eine zusammenhängende Dokumentenmenge und ermöglicht den Einsatz von Banking-Verfahren, die auf den von Google genutzten Verfahren basieren.
Date: 20. 3.2005 16:23:22

Faloutsos, C.: Signature files (1992) 0.02

0.024565458 = product of:
  0.036848187 = sum of:
    0.008542033 = weight(_text_:in in 3499) [ClassicSimilarity], result of:
      0.008542033 = score(doc=3499,freq=2.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.120230645 = fieldWeight in 3499, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0625 = fieldNorm(doc=3499)
    0.028306156 = product of:
      0.056612313 = sum of:
        0.056612313 = weight(_text_:22 in 3499) [ClassicSimilarity], result of:
          0.056612313 = score(doc=3499,freq=2.0), product of:
            0.18290302 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.052230705 = queryNorm
            0.30952093 = fieldWeight in 3499, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=3499)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: Presents a survey and discussion on signature-based text retrieval methods. It describes the main idea behind the signature approach and its advantages over other text retrieval methods, it provides a classification of the signature methods that have appeared in the literature, it describes the main representatives of each class, together with the relative advantages and drawbacks, and it gives a list of applications as well as commercial or university prototypes that use the signature approach
Date: 7. 5.1999 15:22:48

Witschel, H.F.: Global term weights in distributed environments (2008) 0.02

0.023703363 = product of:
  0.035555042 = sum of:
    0.014325427 = weight(_text_:in in 2096) [ClassicSimilarity], result of:
      0.014325427 = score(doc=2096,freq=10.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.20163295 = fieldWeight in 2096, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.046875 = fieldNorm(doc=2096)
    0.021229617 = product of:
      0.042459235 = sum of:
        0.042459235 = weight(_text_:22 in 2096) [ClassicSimilarity], result of:
          0.042459235 = score(doc=2096,freq=2.0), product of:
            0.18290302 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.052230705 = queryNorm
            0.23214069 = fieldWeight in 2096, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=2096)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: This paper examines the estimation of global term weights (such as IDF) in information retrieval scenarios where a global view on the collection is not available. In particular, the two options of either sampling documents or of using a reference corpus independent of the target retrieval collection are compared using standard IR test collections. In addition, the possibility of pruning term lists based on frequency is evaluated. The results show that very good retrieval performance can be reached when just the most frequent terms of a collection - an "extended stop word list" - are known and all terms which are not in that list are treated equally. However, the list cannot always be fully estimated from a general-purpose reference corpus, but some "domain-specific stop words" need to be added. A good solution for achieving this is to mix estimates from small samples of the target retrieval collection with ones derived from a reference corpus.
Date: 1. 8.2008 9:44:22

Search (319 results, page 1 of 16)

Authors

Years

Languages

Types

Themes

Subjects

Classifications