Document (#32021)

Author
Sherman, C.
Price, G.
Title
¬The invisible Web : uncovering sources search engines can't see
Source
Library trends. 52(2004) no.2, S.282-298
Year
2004
Abstract
The paradox of the Invisible Web is that it's easy to understand why it exists, but it's very hard to actually define in concrete, specific terms. In a nutshell, the Invisible Web consists of content that's been excluded from general-purpose search engines and Web directories such as Lycos and LookSmart-and yes, even Google. There's nothing inherently "invisible" about this content. But since this content is not easily located with the information-seeking tools used by most Web users, it's effectively invisible because it's so difficult to find unless you know exactly where to look. In this paper, we define the Invisible Web and delve into the reasons search engines can't "see" its content. We also discuss the four different "types" of invisibility, ranging from the "opaque" Web which is relatively accessible to the searcher, to the truly invisible Web, which requires specialized finding aids to access effectively.
Footnote
Beitrag in einem Themenheft: Organizing the Internet
Theme
Internet

Similar documents (author)

  1. Sherman, C.; Price, G.: ¬The invisible Web : uncovering information sources search engines can't see (2001) 6.26
    6.2580733 = sum of:
      6.2580733 = sum of:
        2.6560621 = weight(author_txt:price in 62) [ClassicSimilarity], result of:
          2.6560621 = score(doc=62,freq=1.0), product of:
            0.63231665 = queryWeight, product of:
              8.401051 = idf(docFreq=26, maxDocs=44218)
              0.075266376 = queryNorm
            4.2005253 = fieldWeight in 62, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.401051 = idf(docFreq=26, maxDocs=44218)
              0.5 = fieldNorm(doc=62)
        3.6020112 = weight(author_txt:sherman in 62) [ClassicSimilarity], result of:
          3.6020112 = score(doc=62,freq=1.0), product of:
            0.77471 = queryWeight, product of:
              1.1068845 = boost
              9.298992 = idf(docFreq=10, maxDocs=44218)
              0.075266376 = queryNorm
            4.649496 = fieldWeight in 62, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.298992 = idf(docFreq=10, maxDocs=44218)
              0.5 = fieldNorm(doc=62)
    
  2. Sherman, C.R.: ICONCLASS: a historical perspective (1987) 2.25
    2.251257 = sum of:
      2.251257 = product of:
        4.502514 = sum of:
          4.502514 = weight(author_txt:sherman in 3269) [ClassicSimilarity], result of:
            4.502514 = score(doc=3269,freq=1.0), product of:
              0.77471 = queryWeight, product of:
                1.1068845 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.075266376 = queryNorm
              5.81187 = fieldWeight in 3269, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.625 = fieldNorm(doc=3269)
        0.5 = coord(1/2)
    
  3. Sherman, R.J.: ¬The electronic book (1993) 2.25
    2.251257 = sum of:
      2.251257 = product of:
        4.502514 = sum of:
          4.502514 = weight(author_txt:sherman in 4568) [ClassicSimilarity], result of:
            4.502514 = score(doc=4568,freq=1.0), product of:
              0.77471 = queryWeight, product of:
                1.1068845 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.075266376 = queryNorm
              5.81187 = fieldWeight in 4568, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.625 = fieldNorm(doc=4568)
        0.5 = coord(1/2)
    
  4. Sherman, C.: What's new with Web search (2000) 2.25
    2.251257 = sum of:
      2.251257 = product of:
        4.502514 = sum of:
          4.502514 = weight(author_txt:sherman in 4795) [ClassicSimilarity], result of:
            4.502514 = score(doc=4795,freq=1.0), product of:
              0.77471 = queryWeight, product of:
                1.1068845 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.075266376 = queryNorm
              5.81187 = fieldWeight in 4795, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.625 = fieldNorm(doc=4795)
        0.5 = coord(1/2)
    
  5. Sherman, C.: ¬The future of Web search (1999) 2.25
    2.251257 = sum of:
      2.251257 = product of:
        4.502514 = sum of:
          4.502514 = weight(author_txt:sherman in 4796) [ClassicSimilarity], result of:
            4.502514 = score(doc=4796,freq=1.0), product of:
              0.77471 = queryWeight, product of:
                1.1068845 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.075266376 = queryNorm
              5.81187 = fieldWeight in 4796, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.625 = fieldNorm(doc=4796)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Lewandowski, D.; Mayr, P.: Exploring the academic invisible Web (2006) 0.26
    0.26463538 = sum of:
      0.26463538 = product of:
        1.1026474 = sum of:
          0.006201785 = weight(abstract_txt:this in 2580) [ClassicSimilarity], result of:
            0.006201785 = score(doc=2580,freq=2.0), product of:
              0.02907778 = queryWeight, product of:
                1.0438864 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.011543767 = queryNorm
              0.21328263 = fieldWeight in 2580, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0625 = fieldNorm(doc=2580)
          0.06631007 = weight(abstract_txt:uncovering in 2580) [ClassicSimilarity], result of:
            0.06631007 = score(doc=2580,freq=1.0), product of:
              0.123283796 = queryWeight, product of:
                1.2409805 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.011543767 = queryNorm
              0.5378653 = fieldWeight in 2580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.0625 = fieldNorm(doc=2580)
          0.026462492 = weight(abstract_txt:search in 2580) [ClassicSimilarity], result of:
            0.026462492 = score(doc=2580,freq=3.0), product of:
              0.06682518 = queryWeight, product of:
                1.5824962 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.011543767 = queryNorm
              0.3959958 = fieldWeight in 2580, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=2580)
          0.04702329 = weight(abstract_txt:define in 2580) [ClassicSimilarity], result of:
            0.04702329 = score(doc=2580,freq=1.0), product of:
              0.12352029 = queryWeight, product of:
                1.7566941 = boost
                6.091085 = idf(docFreq=271, maxDocs=44218)
                0.011543767 = queryNorm
              0.3806928 = fieldWeight in 2580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.091085 = idf(docFreq=271, maxDocs=44218)
                0.0625 = fieldNorm(doc=2580)
          0.0805579 = weight(abstract_txt:engines in 2580) [ClassicSimilarity], result of:
            0.0805579 = score(doc=2580,freq=3.0), product of:
              0.14036487 = queryWeight, product of:
                2.2935162 = boost
                5.3016257 = idf(docFreq=598, maxDocs=44218)
                0.011543767 = queryNorm
              0.5739178 = fieldWeight in 2580, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.3016257 = idf(docFreq=598, maxDocs=44218)
                0.0625 = fieldNorm(doc=2580)
          0.87609184 = weight(abstract_txt:invisible in 2580) [ClassicSimilarity], result of:
            0.87609184 = score(doc=2580,freq=7.0), product of:
              0.6890003 = queryWeight, product of:
                7.7619476 = boost
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.011543767 = queryNorm
              1.2715405 = fieldWeight in 2580, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.0625 = fieldNorm(doc=2580)
        0.24 = coord(6/25)
    
  2. Lewandowski, D.; Mayr, P.: Exploring the academic invisible Web (2006) 0.25
    0.24903816 = sum of:
      0.24903816 = product of:
        1.037659 = sum of:
          0.006201785 = weight(abstract_txt:this in 3752) [ClassicSimilarity], result of:
            0.006201785 = score(doc=3752,freq=2.0), product of:
              0.02907778 = queryWeight, product of:
                1.0438864 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.011543767 = queryNorm
              0.21328263 = fieldWeight in 3752, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0625 = fieldNorm(doc=3752)
          0.06631007 = weight(abstract_txt:uncovering in 3752) [ClassicSimilarity], result of:
            0.06631007 = score(doc=3752,freq=1.0), product of:
              0.123283796 = queryWeight, product of:
                1.2409805 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.011543767 = queryNorm
              0.5378653 = fieldWeight in 3752, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.0625 = fieldNorm(doc=3752)
          0.026462492 = weight(abstract_txt:search in 3752) [ClassicSimilarity], result of:
            0.026462492 = score(doc=3752,freq=3.0), product of:
              0.06682518 = queryWeight, product of:
                1.5824962 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.011543767 = queryNorm
              0.3959958 = fieldWeight in 3752, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=3752)
          0.04702329 = weight(abstract_txt:define in 3752) [ClassicSimilarity], result of:
            0.04702329 = score(doc=3752,freq=1.0), product of:
              0.12352029 = queryWeight, product of:
                1.7566941 = boost
                6.091085 = idf(docFreq=271, maxDocs=44218)
                0.011543767 = queryNorm
              0.3806928 = fieldWeight in 3752, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.091085 = idf(docFreq=271, maxDocs=44218)
                0.0625 = fieldNorm(doc=3752)
          0.0805579 = weight(abstract_txt:engines in 3752) [ClassicSimilarity], result of:
            0.0805579 = score(doc=3752,freq=3.0), product of:
              0.14036487 = queryWeight, product of:
                2.2935162 = boost
                5.3016257 = idf(docFreq=598, maxDocs=44218)
                0.011543767 = queryNorm
              0.5739178 = fieldWeight in 3752, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.3016257 = idf(docFreq=598, maxDocs=44218)
                0.0625 = fieldNorm(doc=3752)
          0.81110346 = weight(abstract_txt:invisible in 3752) [ClassicSimilarity], result of:
            0.81110346 = score(doc=3752,freq=6.0), product of:
              0.6890003 = queryWeight, product of:
                7.7619476 = boost
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.011543767 = queryNorm
              1.1772178 = fieldWeight in 3752, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.0625 = fieldNorm(doc=3752)
        0.24 = coord(6/25)
    
  3. Sherman, C.; Price, G.: ¬The invisible Web : uncovering information sources search engines can't see (2001) 0.14
    0.1374815 = sum of:
      0.1374815 = product of:
        0.6874075 = sum of:
          0.03469641 = weight(abstract_txt:located in 62) [ClassicSimilarity], result of:
            0.03469641 = score(doc=62,freq=1.0), product of:
              0.08005271 = queryWeight, product of:
                6.9347134 = idf(docFreq=116, maxDocs=44218)
                0.011543767 = queryNorm
              0.4334196 = fieldWeight in 62, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9347134 = idf(docFreq=116, maxDocs=44218)
                0.0625 = fieldNorm(doc=62)
          0.006201785 = weight(abstract_txt:this in 62) [ClassicSimilarity], result of:
            0.006201785 = score(doc=62,freq=2.0), product of:
              0.02907778 = queryWeight, product of:
                1.0438864 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.011543767 = queryNorm
              0.21328263 = fieldWeight in 62, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0625 = fieldNorm(doc=62)
          0.026462492 = weight(abstract_txt:search in 62) [ClassicSimilarity], result of:
            0.026462492 = score(doc=62,freq=3.0), product of:
              0.06682518 = queryWeight, product of:
                1.5824962 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.011543767 = queryNorm
              0.3959958 = fieldWeight in 62, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=62)
          0.046510126 = weight(abstract_txt:engines in 62) [ClassicSimilarity], result of:
            0.046510126 = score(doc=62,freq=1.0), product of:
              0.14036487 = queryWeight, product of:
                2.2935162 = boost
                5.3016257 = idf(docFreq=598, maxDocs=44218)
                0.011543767 = queryNorm
              0.3313516 = fieldWeight in 62, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3016257 = idf(docFreq=598, maxDocs=44218)
                0.0625 = fieldNorm(doc=62)
          0.5735367 = weight(abstract_txt:invisible in 62) [ClassicSimilarity], result of:
            0.5735367 = score(doc=62,freq=3.0), product of:
              0.6890003 = queryWeight, product of:
                7.7619476 = boost
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.011543767 = queryNorm
              0.8324186 = fieldWeight in 62, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.0625 = fieldNorm(doc=62)
        0.2 = coord(5/25)
    
  4. Borgman, C.L.: ¬The invisible library : paradox of the global information infrastructure (2003) 0.09
    0.09055226 = sum of:
      0.09055226 = product of:
        0.7546022 = sum of:
          0.0065779868 = weight(abstract_txt:this in 1) [ClassicSimilarity], result of:
            0.0065779868 = score(doc=1,freq=1.0), product of:
              0.02907778 = queryWeight, product of:
                1.0438864 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.011543767 = queryNorm
              0.2262204 = fieldWeight in 1, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.09375 = fieldNorm(doc=1)
          0.045588057 = weight(abstract_txt:content in 1) [ClassicSimilarity], result of:
            0.045588057 = score(doc=1,freq=1.0), product of:
              0.11633567 = queryWeight, product of:
                2.4110067 = boost
                4.17991 = idf(docFreq=1838, maxDocs=44218)
                0.011543767 = queryNorm
              0.39186656 = fieldWeight in 1, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.17991 = idf(docFreq=1838, maxDocs=44218)
                0.09375 = fieldNorm(doc=1)
          0.70243615 = weight(abstract_txt:invisible in 1) [ClassicSimilarity], result of:
            0.70243615 = score(doc=1,freq=2.0), product of:
              0.6890003 = queryWeight, product of:
                7.7619476 = boost
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.011543767 = queryNorm
              1.0195005 = fieldWeight in 1, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.09375 = fieldNorm(doc=1)
        0.12 = coord(3/25)
    
  5. Breeding, M.: Thinking about your next OPAC (2007) 0.08
    0.07672779 = sum of:
      0.07672779 = product of:
        0.6393983 = sum of:
          0.16266276 = weight(abstract_txt:that's in 6745) [ClassicSimilarity], result of:
            0.16266276 = score(doc=6745,freq=1.0), product of:
              0.14126192 = queryWeight, product of:
                1.3283867 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.011543767 = queryNorm
              1.1514976 = fieldWeight in 6745, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.125 = fieldNorm(doc=6745)
          0.06078408 = weight(abstract_txt:content in 6745) [ClassicSimilarity], result of:
            0.06078408 = score(doc=6745,freq=1.0), product of:
              0.11633567 = queryWeight, product of:
                2.4110067 = boost
                4.17991 = idf(docFreq=1838, maxDocs=44218)
                0.011543767 = queryNorm
              0.5224888 = fieldWeight in 6745, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.17991 = idf(docFreq=1838, maxDocs=44218)
                0.125 = fieldNorm(doc=6745)
          0.41595143 = weight(abstract_txt:it's in 6745) [ClassicSimilarity], result of:
            0.41595143 = score(doc=6745,freq=1.0), product of:
              0.4193224 = queryWeight, product of:
                4.5773697 = boost
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.011543767 = queryNorm
              0.9919609 = fieldWeight in 6745, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.125 = fieldNorm(doc=6745)
        0.12 = coord(3/25)