Search (30 results, page 1 of 2)

  • × theme_ss:"Retrievalalgorithmen"
  1. Zhang, W.; Yoshida, T.; Tang, X.: ¬A comparative study of TF*IDF, LSI and multi-words for text classification (2011) 0.02
    0.021207385 = product of:
      0.12724431 = sum of:
        0.12724431 = weight(_text_:themes in 1165) [ClassicSimilarity], result of:
          0.12724431 = score(doc=1165,freq=2.0), product of:
            0.29856348 = queryWeight, product of:
              6.429029 = idf(docFreq=193, maxDocs=44218)
              0.046439905 = queryNorm
            0.42618844 = fieldWeight in 1165, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              6.429029 = idf(docFreq=193, maxDocs=44218)
              0.046875 = fieldNorm(doc=1165)
      0.16666667 = coord(1/6)
    
    Abstract
    One of the main themes in text mining is text representation, which is fundamental and indispensable for text-based intellegent information processing. Generally, text representation inludes two tasks: indexing and weighting. This paper has comparatively studied TF*IDF, LSI and multi-word for text representation. We used a Chinese and an English document collection to respectively evaluate the three methods in information retreival and text categorization. Experimental results have demonstrated that in text categorization, LSI has better performance than other methods in both document collections. Also, LSI has produced the best performance in retrieving English documents. This outcome has shown that LSI has both favorable semantic and statistical quality and is different with the claim that LSI can not produce discriminative power for indexing.
  2. Voorhees, E.M.: Implementing agglomerative hierarchic clustering algorithms for use in document retrieval (1986) 0.02
    0.016778577 = product of:
      0.100671455 = sum of:
        0.100671455 = weight(_text_:22 in 402) [ClassicSimilarity], result of:
          0.100671455 = score(doc=402,freq=2.0), product of:
            0.16262463 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046439905 = queryNorm
            0.61904186 = fieldWeight in 402, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.125 = fieldNorm(doc=402)
      0.16666667 = coord(1/6)
    
    Source
    Information processing and management. 22(1986) no.6, S.465-476
  3. Smeaton, A.F.; Rijsbergen, C.J. van: ¬The retrieval effects of query expansion on a feedback document retrieval system (1983) 0.01
    0.014681254 = product of:
      0.08808752 = sum of:
        0.08808752 = weight(_text_:22 in 2134) [ClassicSimilarity], result of:
          0.08808752 = score(doc=2134,freq=2.0), product of:
            0.16262463 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046439905 = queryNorm
            0.5416616 = fieldWeight in 2134, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=2134)
      0.16666667 = coord(1/6)
    
    Date
    30. 3.2001 13:32:22
  4. Back, J.: ¬An evaluation of relevancy ranking techniques used by Internet search engines (2000) 0.01
    0.014681254 = product of:
      0.08808752 = sum of:
        0.08808752 = weight(_text_:22 in 3445) [ClassicSimilarity], result of:
          0.08808752 = score(doc=3445,freq=2.0), product of:
            0.16262463 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046439905 = queryNorm
            0.5416616 = fieldWeight in 3445, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=3445)
      0.16666667 = coord(1/6)
    
    Date
    25. 8.2005 17:42:22
  5. Fuhr, N.: Ranking-Experimente mit gewichteter Indexierung (1986) 0.01
    0.012583932 = product of:
      0.07550359 = sum of:
        0.07550359 = weight(_text_:22 in 58) [ClassicSimilarity], result of:
          0.07550359 = score(doc=58,freq=2.0), product of:
            0.16262463 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046439905 = queryNorm
            0.46428138 = fieldWeight in 58, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=58)
      0.16666667 = coord(1/6)
    
    Date
    14. 6.2015 22:12:44
  6. Fuhr, N.: Rankingexperimente mit gewichteter Indexierung (1986) 0.01
    0.012583932 = product of:
      0.07550359 = sum of:
        0.07550359 = weight(_text_:22 in 2051) [ClassicSimilarity], result of:
          0.07550359 = score(doc=2051,freq=2.0), product of:
            0.16262463 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046439905 = queryNorm
            0.46428138 = fieldWeight in 2051, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=2051)
      0.16666667 = coord(1/6)
    
    Date
    14. 6.2015 22:12:56
  7. MacFarlane, A.; Robertson, S.E.; McCann, J.A.: Parallel computing for passage retrieval (2004) 0.01
    0.008389289 = product of:
      0.050335728 = sum of:
        0.050335728 = weight(_text_:22 in 5108) [ClassicSimilarity], result of:
          0.050335728 = score(doc=5108,freq=2.0), product of:
            0.16262463 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046439905 = queryNorm
            0.30952093 = fieldWeight in 5108, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=5108)
      0.16666667 = coord(1/6)
    
    Date
    20. 1.2007 18:30:22
  8. Faloutsos, C.: Signature files (1992) 0.01
    0.008389289 = product of:
      0.050335728 = sum of:
        0.050335728 = weight(_text_:22 in 3499) [ClassicSimilarity], result of:
          0.050335728 = score(doc=3499,freq=2.0), product of:
            0.16262463 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046439905 = queryNorm
            0.30952093 = fieldWeight in 3499, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=3499)
      0.16666667 = coord(1/6)
    
    Date
    7. 5.1999 15:22:48
  9. Losada, D.E.; Barreiro, A.: Emebedding term similarity and inverse document frequency into a logical model of information retrieval (2003) 0.01
    0.008389289 = product of:
      0.050335728 = sum of:
        0.050335728 = weight(_text_:22 in 1422) [ClassicSimilarity], result of:
          0.050335728 = score(doc=1422,freq=2.0), product of:
            0.16262463 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046439905 = queryNorm
            0.30952093 = fieldWeight in 1422, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=1422)
      0.16666667 = coord(1/6)
    
    Date
    22. 3.2003 19:27:23
  10. Bornmann, L.; Mutz, R.: From P100 to P100' : a new citation-rank approach (2014) 0.01
    0.008389289 = product of:
      0.050335728 = sum of:
        0.050335728 = weight(_text_:22 in 1431) [ClassicSimilarity], result of:
          0.050335728 = score(doc=1431,freq=2.0), product of:
            0.16262463 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046439905 = queryNorm
            0.30952093 = fieldWeight in 1431, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=1431)
      0.16666667 = coord(1/6)
    
    Date
    22. 8.2014 17:05:18
  11. Tober, M.; Hennig, L.; Furch, D.: SEO Ranking-Faktoren und Rang-Korrelationen 2014 : Google Deutschland (2014) 0.01
    0.008389289 = product of:
      0.050335728 = sum of:
        0.050335728 = weight(_text_:22 in 1484) [ClassicSimilarity], result of:
          0.050335728 = score(doc=1484,freq=2.0), product of:
            0.16262463 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046439905 = queryNorm
            0.30952093 = fieldWeight in 1484, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=1484)
      0.16666667 = coord(1/6)
    
    Date
    13. 9.2014 14:45:22
  12. Ravana, S.D.; Rajagopal, P.; Balakrishnan, V.: Ranking retrieval systems using pseudo relevance judgments (2015) 0.01
    0.0074151526 = product of:
      0.044490915 = sum of:
        0.044490915 = weight(_text_:22 in 2591) [ClassicSimilarity], result of:
          0.044490915 = score(doc=2591,freq=4.0), product of:
            0.16262463 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046439905 = queryNorm
            0.27358043 = fieldWeight in 2591, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2591)
      0.16666667 = coord(1/6)
    
    Date
    20. 1.2015 18:30:22
    18. 9.2018 18:22:56
  13. Chang, C.-H.; Hsu, C.-C.: Integrating query expansion and conceptual relevance feedback for personalized Web information retrieval (1998) 0.01
    0.007340627 = product of:
      0.04404376 = sum of:
        0.04404376 = weight(_text_:22 in 1319) [ClassicSimilarity], result of:
          0.04404376 = score(doc=1319,freq=2.0), product of:
            0.16262463 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046439905 = queryNorm
            0.2708308 = fieldWeight in 1319, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1319)
      0.16666667 = coord(1/6)
    
    Date
    1. 8.1996 22:08:06
  14. Kanaeva, Z.: Ranking: Google und CiteSeer (2005) 0.01
    0.007340627 = product of:
      0.04404376 = sum of:
        0.04404376 = weight(_text_:22 in 3276) [ClassicSimilarity], result of:
          0.04404376 = score(doc=3276,freq=2.0), product of:
            0.16262463 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046439905 = queryNorm
            0.2708308 = fieldWeight in 3276, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3276)
      0.16666667 = coord(1/6)
    
    Date
    20. 3.2005 16:23:22
  15. Joss, M.W.; Wszola, S.: ¬The engines that can : text search and retrieval software, their strategies, and vendors (1996) 0.01
    0.006291966 = product of:
      0.037751794 = sum of:
        0.037751794 = weight(_text_:22 in 5123) [ClassicSimilarity], result of:
          0.037751794 = score(doc=5123,freq=2.0), product of:
            0.16262463 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046439905 = queryNorm
            0.23214069 = fieldWeight in 5123, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=5123)
      0.16666667 = coord(1/6)
    
    Date
    12. 9.1996 13:56:22
  16. Kelledy, F.; Smeaton, A.F.: Signature files and beyond (1996) 0.01
    0.006291966 = product of:
      0.037751794 = sum of:
        0.037751794 = weight(_text_:22 in 6973) [ClassicSimilarity], result of:
          0.037751794 = score(doc=6973,freq=2.0), product of:
            0.16262463 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046439905 = queryNorm
            0.23214069 = fieldWeight in 6973, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=6973)
      0.16666667 = coord(1/6)
    
    Source
    Information retrieval: new systems and current research. Proceedings of the 16th Research Colloquium of the British Computer Society Information Retrieval Specialist Group, Drymen, Scotland, 22-23 Mar 94. Ed.: R. Leon
  17. Crestani, F.; Dominich, S.; Lalmas, M.; Rijsbergen, C.J.K. van: Mathematical, logical, and formal methods in information retrieval : an introduction to the special issue (2003) 0.01
    0.006291966 = product of:
      0.037751794 = sum of:
        0.037751794 = weight(_text_:22 in 1451) [ClassicSimilarity], result of:
          0.037751794 = score(doc=1451,freq=2.0), product of:
            0.16262463 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046439905 = queryNorm
            0.23214069 = fieldWeight in 1451, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=1451)
      0.16666667 = coord(1/6)
    
    Date
    22. 3.2003 19:27:36
  18. Fan, W.; Fox, E.A.; Pathak, P.; Wu, H.: ¬The effects of fitness functions an genetic programming-based ranking discovery for Web search (2004) 0.01
    0.006291966 = product of:
      0.037751794 = sum of:
        0.037751794 = weight(_text_:22 in 2239) [ClassicSimilarity], result of:
          0.037751794 = score(doc=2239,freq=2.0), product of:
            0.16262463 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046439905 = queryNorm
            0.23214069 = fieldWeight in 2239, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=2239)
      0.16666667 = coord(1/6)
    
    Date
    31. 5.2004 19:22:06
  19. Furner, J.: ¬A unifying model of document relatedness for hybrid search engines (2003) 0.01
    0.006291966 = product of:
      0.037751794 = sum of:
        0.037751794 = weight(_text_:22 in 2717) [ClassicSimilarity], result of:
          0.037751794 = score(doc=2717,freq=2.0), product of:
            0.16262463 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046439905 = queryNorm
            0.23214069 = fieldWeight in 2717, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=2717)
      0.16666667 = coord(1/6)
    
    Date
    11. 9.2004 17:32:22
  20. Witschel, H.F.: Global term weights in distributed environments (2008) 0.01
    0.006291966 = product of:
      0.037751794 = sum of:
        0.037751794 = weight(_text_:22 in 2096) [ClassicSimilarity], result of:
          0.037751794 = score(doc=2096,freq=2.0), product of:
            0.16262463 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046439905 = queryNorm
            0.23214069 = fieldWeight in 2096, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=2096)
      0.16666667 = coord(1/6)
    
    Date
    1. 8.2008 9:44:22