Search (3 results, page 1 of 1)

  • × theme_ss:"Automatisches Indexieren"
  • × theme_ss:"Internet"
  1. Thirion, B.; Leroy, J.P.; Baudic, F.; Douyère, M.; Piot, J.; Darmoni, S.J.: SDI selecting, decribing, and indexing : did you mean automatically? (2001) 0.02
    0.017572623 = product of:
      0.061504178 = sum of:
        0.041002784 = weight(_text_:j in 6198) [ClassicSimilarity], result of:
          0.041002784 = score(doc=6198,freq=2.0), product of:
            0.09732894 = queryWeight, product of:
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.030630698 = queryNorm
            0.4212805 = fieldWeight in 6198, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.09375 = fieldNorm(doc=6198)
        0.020501392 = product of:
          0.041002784 = sum of:
            0.041002784 = weight(_text_:j in 6198) [ClassicSimilarity], result of:
              0.041002784 = score(doc=6198,freq=2.0), product of:
                0.09732894 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.030630698 = queryNorm
                0.4212805 = fieldWeight in 6198, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.09375 = fieldNorm(doc=6198)
          0.5 = coord(1/2)
      0.2857143 = coord(2/7)
    
  2. Rasmussen, E.M.: Indexing and retrieval for the Web (2002) 0.00
    0.00429168 = product of:
      0.03004176 = sum of:
        0.03004176 = product of:
          0.06008352 = sum of:
            0.06008352 = weight(_text_:huang in 4285) [ClassicSimilarity], result of:
              0.06008352 = score(doc=4285,freq=2.0), product of:
                0.21815723 = queryWeight, product of:
                  7.122176 = idf(docFreq=96, maxDocs=44218)
                  0.030630698 = queryNorm
                0.27541384 = fieldWeight in 4285, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  7.122176 = idf(docFreq=96, maxDocs=44218)
                  0.02734375 = fieldNorm(doc=4285)
          0.5 = coord(1/2)
      0.14285715 = coord(1/7)
    
    Abstract
    Techniques for automated indexing and information retrieval (IR) have been developed, tested, and refined over the past 40 years, and are well documented (see, for example, Agosti & Smeaton, 1996; BaezaYates & Ribeiro-Neto, 1999a; Frakes & Baeza-Yates, 1992; Korfhage, 1997; Salton, 1989; Witten, Moffat, & Bell, 1999). With the introduction of the Web, and the capability to index and retrieve via search engines, these techniques have been extended to a new environment. They have been adopted, altered, and in some Gases extended to include new methods. "In short, search engines are indispensable for searching the Web, they employ a variety of relatively advanced IR techniques, and there are some peculiar aspects of search engines that make searching the Web different than more conventional information retrieval" (Gordon & Pathak, 1999, p. 145). The environment for information retrieval an the World Wide Web differs from that of "conventional" information retrieval in a number of fundamental ways. The collection is very large and changes continuously, with pages being added, deleted, and altered. Wide variability between the size, structure, focus, quality, and usefulness of documents makes Web documents much more heterogeneous than a typical electronic document collection. The wide variety of document types includes images, video, audio, and scripts, as well as many different document languages. Duplication of documents and sites is common. Documents are interconnected through networks of hyperlinks. Because of the size and dynamic nature of the Web, preprocessing all documents requires considerable resources and is often not feasible, certainly not an the frequent basis required to ensure currency. Query length is usually much shorter than in other environments-only a few words-and user behavior differs from that in other environments. These differences make the Web a novel environment for information retrieval (Baeza-Yates & Ribeiro-Neto, 1999b; Bharat & Henzinger, 1998; Huang, 2000).
  3. Wolfekuhler, M.R.; Punch, W.F.: Finding salient features for personal Web pages categories (1997) 0.00
    0.0020750186 = product of:
      0.014525129 = sum of:
        0.014525129 = product of:
          0.029050259 = sum of:
            0.029050259 = weight(_text_:22 in 2673) [ClassicSimilarity], result of:
              0.029050259 = score(doc=2673,freq=2.0), product of:
                0.10726349 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.030630698 = queryNorm
                0.2708308 = fieldWeight in 2673, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2673)
          0.5 = coord(1/2)
      0.14285715 = coord(1/7)
    
    Date
    1. 8.1996 22:08:06