Search (3 results, page 1 of 1)

Did you mean:
rvk_ss%3a%2200 80852 allgemeines %2f buch- und bibliothekswesen%2c informationswissenschaft %2f bibliothekswesen %2f bibliothekswesen in einzelnen L%c3%aendern und einzelne bibliotheken %2f einzelne deutsche bibliotheken %2f bibliotheken S %2f stuttgart %2f university%c3%A4tsbibliothek hohenheim%22 3
rvk_ss%3a%2200 80852 allgemeines %2f buch- und bibliothekswesen%2c informationswissenschaft %2f bibliothekswesen %2f bibliothekswesen in einzelnen L%c3%aendern und einzelne bibliotheken %2f einzelne deutsche bibliotheken %2f bibliotheken S %2f stuttgart %2f université%c3%A4tsbibliothek hohenheim%22 3
rvk_ss%3a%2200 80852 allgemeines %2f buch- und bibliothekswesen%2c informationswissenschaft %2f bibliothekswesen %2f bibliothekswesen in einzelnen L%c3%aendern und einzelne bibliotheken %2f einzelnen deutsche bibliotheken %2f bibliotheken S %2f stuttgart %2f university%c3%A4tsbibliothek hohenheim%22 3
rvk_ss%3a%2200 80852 allgemeines %2f buch- und bibliothekswesen%2c informationswissenschaft %2f bibliothekswesen %2f bibliothekswesen in einzelnen L%c3%aendern und einzelnen bibliotheken %2f einzelne deutsche bibliotheken %2f bibliotheken S %2f stuttgart %2f university%c3%A4tsbibliothek hohenheim%22 3
rvk_ss%3a%2200 80852 allgemeines %2f buch- und bibliothekswesen%2c informationswissenschaft %2f bibliothekswesen %2f bibliothekswesen in einzelnen L%c3%andern und einzelne bibliotheken %2f einzelne deutsche bibliotheken %2f bibliotheken S %2f stuttgart %2f university%c3%A4tsbibliothek hohenheim%22 3

Efron, M.: Linear time series models for term weighting in information retrieval (2010) 0.00
```
5.0708273E-4 = product of:
  0.007606241 = sum of:
    0.005915991 = weight(_text_:in in 3688) [ClassicSimilarity], result of:
      0.005915991 = score(doc=3688,freq=10.0), product of:
        0.029340398 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021569785 = queryNorm
        0.20163295 = fieldWeight in 3688, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.046875 = fieldNorm(doc=3688)
    0.0016902501 = weight(_text_:s in 3688) [ClassicSimilarity], result of:
      0.0016902501 = score(doc=3688,freq=2.0), product of:
        0.023451481 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.021569785 = queryNorm
        0.072074346 = fieldWeight in 3688, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.046875 = fieldNorm(doc=3688)
  0.06666667 = coord(2/30)
```
Abstract

Common measures of term importance in information retrieval (IR) rely on counts of term frequency; rare terms receive higher weight in document ranking than common terms receive. However, realistic scenarios yield additional information about terms in a collection. Of interest in this article is the temporal behavior of terms as a collection changes over time. We propose capturing each term's collection frequency at discrete time intervals over the lifespan of a corpus and analyzing the resulting time series. We hypothesize the collection frequency of a weakly discriminative term x at time t is predictable by a linear model of the term's prior observations. On the other hand, a linear time series model for a strong discriminators' collection frequency will yield a poor fit to the data. Operationalizing this hypothesis, we induce three time-based measures of term importance and test these against state-of-the-art term weighting models.

Source

Journal of the American Society for Information Science and Technology. 61(2010) no.7, S.1299-1312
Efron, M.: Query expansion and dimensionality reduction : Notions of optimality in Rocchio relevance feedback and latent semantic indexing (2008) 0.00
```
4.6544487E-4 = product of:
  0.0069816727 = sum of:
    0.0052914224 = weight(_text_:in in 2020) [ClassicSimilarity], result of:
      0.0052914224 = score(doc=2020,freq=8.0), product of:
        0.029340398 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021569785 = queryNorm
        0.18034597 = fieldWeight in 2020, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.046875 = fieldNorm(doc=2020)
    0.0016902501 = weight(_text_:s in 2020) [ClassicSimilarity], result of:
      0.0016902501 = score(doc=2020,freq=2.0), product of:
        0.023451481 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.021569785 = queryNorm
        0.072074346 = fieldWeight in 2020, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.046875 = fieldNorm(doc=2020)
  0.06666667 = coord(2/30)
```
Abstract

Rocchio relevance feedback and latent semantic indexing (LSI) are well-known extensions of the vector space model for information retrieval (IR). This paper analyzes the statistical relationship between these extensions. The analysis focuses on each method's basis in least-squares optimization. Noting that LSI and Rocchio relevance feedback both alter the vector space model in a way that is in some sense least-squares optimal, we ask: what is the relationship between LSI's and Rocchio's notions of optimality? What does this relationship imply for IR? Using an analytical approach, we argue that Rocchio relevance feedback is optimal if we understand retrieval as a simplified classification problem. On the other hand, LSI's motivation comes to the fore if we understand it as a biased regression technique, where projection onto a low-dimensional orthogonal subspace of the documents reduces model variance.

Source

Information processing and management. 44(2008) no.1, S.163-180

Efron, M.; Winget, M.: Query polyrepresentation for ranking retrieval systems without relevance judgments (2010) 0.00

5.6341672E-5 = product of:
  0.0016902501 = sum of:
    0.0016902501 = weight(_text_:s in 3469) [ClassicSimilarity], result of:
      0.0016902501 = score(doc=3469,freq=2.0), product of:
        0.023451481 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.021569785 = queryNorm
        0.072074346 = fieldWeight in 3469, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.046875 = fieldNorm(doc=3469)
  0.033333335 = coord(1/30)

Source: Journal of the American Society for Information Science and Technology. 61(2010) no.6, S.1081-1091

Search (3 results, page 1 of 1)

Years