Document (#32022)

Author
Sherman, C.
Price, G.
Title
¬The invisible Web : uncovering sources search engines can't see
Source
Library trends. 52(2004) no.2, S.282-298
Year
2004
Abstract
The paradox of the Invisible Web is that it's easy to understand why it exists, but it's very hard to actually define in concrete, specific terms. In a nutshell, the Invisible Web consists of content that's been excluded from general-purpose search engines and Web directories such as Lycos and LookSmart-and yes, even Google. There's nothing inherently "invisible" about this content. But since this content is not easily located with the information-seeking tools used by most Web users, it's effectively invisible because it's so difficult to find unless you know exactly where to look. In this paper, we define the Invisible Web and delve into the reasons search engines can't "see" its content. We also discuss the four different "types" of invisibility, ranging from the "opaque" Web which is relatively accessible to the searcher, to the truly invisible Web, which requires specialized finding aids to access effectively.
Footnote
Beitrag in einem Themenheft: Organizing the Internet
Theme
Internet

Similar documents (author)

  1. Sherman, C.; Price, G.: ¬The invisible Web : uncovering information sources search engines can't see (2001) 6.24
    6.2424884 = sum of:
      6.2424884 = sum of:
        2.6482856 = weight(author_txt:price in 1188) [ClassicSimilarity], result of:
          2.6482856 = score(doc=1188,freq=1.0), product of:
            0.6321239 = queryWeight, product of:
              8.379008 = idf(docFreq=26, maxDocs=43254)
              0.075441375 = queryNorm
            4.189504 = fieldWeight in 1188, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.379008 = idf(docFreq=26, maxDocs=43254)
              0.5 = fieldNorm(doc=1188)
        3.5942028 = weight(author_txt:sherman in 1188) [ClassicSimilarity], result of:
          3.5942028 = score(doc=1188,freq=1.0), product of:
            0.77486736 = queryWeight, product of:
              1.1071656 = boost
              9.27695 = idf(docFreq=10, maxDocs=43254)
              0.075441375 = queryNorm
            4.638475 = fieldWeight in 1188, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.27695 = idf(docFreq=10, maxDocs=43254)
              0.5 = fieldNorm(doc=1188)
    
  2. Sherman, C.R.: ICONCLASS: a historical perspective (1987) 2.25
    2.2463768 = sum of:
      2.2463768 = product of:
        4.4927535 = sum of:
          4.4927535 = weight(author_txt:sherman in 3269) [ClassicSimilarity], result of:
            4.4927535 = score(doc=3269,freq=1.0), product of:
              0.77486736 = queryWeight, product of:
                1.1071656 = boost
                9.27695 = idf(docFreq=10, maxDocs=43254)
                0.075441375 = queryNorm
              5.798094 = fieldWeight in 3269, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.27695 = idf(docFreq=10, maxDocs=43254)
                0.625 = fieldNorm(doc=3269)
        0.5 = coord(1/2)
    
  3. Sherman, R.J.: ¬The electronic book (1993) 2.25
    2.2463768 = sum of:
      2.2463768 = product of:
        4.4927535 = sum of:
          4.4927535 = weight(author_txt:sherman in 4568) [ClassicSimilarity], result of:
            4.4927535 = score(doc=4568,freq=1.0), product of:
              0.77486736 = queryWeight, product of:
                1.1071656 = boost
                9.27695 = idf(docFreq=10, maxDocs=43254)
                0.075441375 = queryNorm
              5.798094 = fieldWeight in 4568, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.27695 = idf(docFreq=10, maxDocs=43254)
                0.625 = fieldNorm(doc=4568)
        0.5 = coord(1/2)
    
  4. Sherman, C.: What's new with Web search (2000) 2.25
    2.2463768 = sum of:
      2.2463768 = product of:
        4.4927535 = sum of:
          4.4927535 = weight(author_txt:sherman in 6796) [ClassicSimilarity], result of:
            4.4927535 = score(doc=6796,freq=1.0), product of:
              0.77486736 = queryWeight, product of:
                1.1071656 = boost
                9.27695 = idf(docFreq=10, maxDocs=43254)
                0.075441375 = queryNorm
              5.798094 = fieldWeight in 6796, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.27695 = idf(docFreq=10, maxDocs=43254)
                0.625 = fieldNorm(doc=6796)
        0.5 = coord(1/2)
    
  5. Sherman, C.: ¬The future of Web search (1999) 2.25
    2.2463768 = sum of:
      2.2463768 = product of:
        4.4927535 = sum of:
          4.4927535 = weight(author_txt:sherman in 6797) [ClassicSimilarity], result of:
            4.4927535 = score(doc=6797,freq=1.0), product of:
              0.77486736 = queryWeight, product of:
                1.1071656 = boost
                9.27695 = idf(docFreq=10, maxDocs=43254)
                0.075441375 = queryNorm
              5.798094 = fieldWeight in 6797, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.27695 = idf(docFreq=10, maxDocs=43254)
                0.625 = fieldNorm(doc=6797)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Lewandowski, D.; Mayr, P.: Exploring the academic invisible Web (2006) 0.27
    0.2658213 = sum of:
      0.2658213 = product of:
        1.1075888 = sum of:
          0.006298041 = weight(abstract_txt:this in 4581) [ClassicSimilarity], result of:
            0.006298041 = score(doc=4581,freq=2.0), product of:
              0.029305147 = queryWeight, product of:
                1.0539087 = boost
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.011436006 = queryNorm
              0.21491244 = fieldWeight in 4581, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.0625 = fieldNorm(doc=4581)
          0.06751426 = weight(abstract_txt:uncovering in 4581) [ClassicSimilarity], result of:
            0.06751426 = score(doc=4581,freq=1.0), product of:
              0.12446298 = queryWeight, product of:
                1.2539797 = boost
                8.679112 = idf(docFreq=19, maxDocs=43254)
                0.011436006 = queryNorm
              0.5424445 = fieldWeight in 4581, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.679112 = idf(docFreq=19, maxDocs=43254)
                0.0625 = fieldNorm(doc=4581)
          0.026156297 = weight(abstract_txt:search in 4581) [ClassicSimilarity], result of:
            0.026156297 = score(doc=4581,freq=3.0), product of:
              0.06614454 = queryWeight, product of:
                1.583354 = boost
                3.6529322 = idf(docFreq=3046, maxDocs=43254)
                0.011436006 = queryNorm
              0.3954415 = fieldWeight in 4581, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6529322 = idf(docFreq=3046, maxDocs=43254)
                0.0625 = fieldNorm(doc=4581)
          0.046767626 = weight(abstract_txt:define in 4581) [ClassicSimilarity], result of:
            0.046767626 = score(doc=4581,freq=1.0), product of:
              0.12276749 = queryWeight, product of:
                1.7612747 = boost
                6.095115 = idf(docFreq=264, maxDocs=43254)
                0.011436006 = queryNorm
              0.3809447 = fieldWeight in 4581, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.095115 = idf(docFreq=264, maxDocs=43254)
                0.0625 = fieldNorm(doc=4581)
          0.079649135 = weight(abstract_txt:engines in 4581) [ClassicSimilarity], result of:
            0.079649135 = score(doc=4581,freq=3.0), product of:
              0.13896237 = queryWeight, product of:
                2.2949839 = boost
                5.2947226 = idf(docFreq=589, maxDocs=43254)
                0.011436006 = queryNorm
              0.57317054 = fieldWeight in 4581, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.2947226 = idf(docFreq=589, maxDocs=43254)
                0.0625 = fieldNorm(doc=4581)
          0.8812034 = weight(abstract_txt:invisible in 4581) [ClassicSimilarity], result of:
            0.8812034 = score(doc=4581,freq=7.0), product of:
              0.68996537 = queryWeight, product of:
                7.8114753 = boost
                7.7236013 = idf(docFreq=51, maxDocs=43254)
                0.011436006 = queryNorm
              1.2771705 = fieldWeight in 4581, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.7236013 = idf(docFreq=51, maxDocs=43254)
                0.0625 = fieldNorm(doc=4581)
        0.24 = coord(6/25)
    
  2. Lewandowski, D.; Mayr, P.: Exploring the academic invisible Web (2006) 0.25
    0.25013307 = sum of:
      0.25013307 = product of:
        1.0422212 = sum of:
          0.006298041 = weight(abstract_txt:this in 5753) [ClassicSimilarity], result of:
            0.006298041 = score(doc=5753,freq=2.0), product of:
              0.029305147 = queryWeight, product of:
                1.0539087 = boost
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.011436006 = queryNorm
              0.21491244 = fieldWeight in 5753, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.0625 = fieldNorm(doc=5753)
          0.06751426 = weight(abstract_txt:uncovering in 5753) [ClassicSimilarity], result of:
            0.06751426 = score(doc=5753,freq=1.0), product of:
              0.12446298 = queryWeight, product of:
                1.2539797 = boost
                8.679112 = idf(docFreq=19, maxDocs=43254)
                0.011436006 = queryNorm
              0.5424445 = fieldWeight in 5753, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.679112 = idf(docFreq=19, maxDocs=43254)
                0.0625 = fieldNorm(doc=5753)
          0.026156297 = weight(abstract_txt:search in 5753) [ClassicSimilarity], result of:
            0.026156297 = score(doc=5753,freq=3.0), product of:
              0.06614454 = queryWeight, product of:
                1.583354 = boost
                3.6529322 = idf(docFreq=3046, maxDocs=43254)
                0.011436006 = queryNorm
              0.3954415 = fieldWeight in 5753, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6529322 = idf(docFreq=3046, maxDocs=43254)
                0.0625 = fieldNorm(doc=5753)
          0.046767626 = weight(abstract_txt:define in 5753) [ClassicSimilarity], result of:
            0.046767626 = score(doc=5753,freq=1.0), product of:
              0.12276749 = queryWeight, product of:
                1.7612747 = boost
                6.095115 = idf(docFreq=264, maxDocs=43254)
                0.011436006 = queryNorm
              0.3809447 = fieldWeight in 5753, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.095115 = idf(docFreq=264, maxDocs=43254)
                0.0625 = fieldNorm(doc=5753)
          0.079649135 = weight(abstract_txt:engines in 5753) [ClassicSimilarity], result of:
            0.079649135 = score(doc=5753,freq=3.0), product of:
              0.13896237 = queryWeight, product of:
                2.2949839 = boost
                5.2947226 = idf(docFreq=589, maxDocs=43254)
                0.011436006 = queryNorm
              0.57317054 = fieldWeight in 5753, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.2947226 = idf(docFreq=589, maxDocs=43254)
                0.0625 = fieldNorm(doc=5753)
          0.81583583 = weight(abstract_txt:invisible in 5753) [ClassicSimilarity], result of:
            0.81583583 = score(doc=5753,freq=6.0), product of:
              0.68996537 = queryWeight, product of:
                7.8114753 = boost
                7.7236013 = idf(docFreq=51, maxDocs=43254)
                0.011436006 = queryNorm
              1.1824301 = fieldWeight in 5753, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.7236013 = idf(docFreq=51, maxDocs=43254)
                0.0625 = fieldNorm(doc=5753)
        0.24 = coord(6/25)
    
  3. Sherman, C.; Price, G.: ¬The invisible Web : uncovering information sources search engines can't see (2001) 0.14
    0.13791241 = sum of:
      0.13791241 = product of:
        0.689562 = sum of:
          0.034239236 = weight(abstract_txt:located in 1188) [ClassicSimilarity], result of:
            0.034239236 = score(doc=1188,freq=1.0), product of:
              0.07915151 = queryWeight, product of:
                6.9212546 = idf(docFreq=115, maxDocs=43254)
                0.011436006 = queryNorm
              0.4325784 = fieldWeight in 1188, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9212546 = idf(docFreq=115, maxDocs=43254)
                0.0625 = fieldNorm(doc=1188)
          0.006298041 = weight(abstract_txt:this in 1188) [ClassicSimilarity], result of:
            0.006298041 = score(doc=1188,freq=2.0), product of:
              0.029305147 = queryWeight, product of:
                1.0539087 = boost
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.011436006 = queryNorm
              0.21491244 = fieldWeight in 1188, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.0625 = fieldNorm(doc=1188)
          0.026156297 = weight(abstract_txt:search in 1188) [ClassicSimilarity], result of:
            0.026156297 = score(doc=1188,freq=3.0), product of:
              0.06614454 = queryWeight, product of:
                1.583354 = boost
                3.6529322 = idf(docFreq=3046, maxDocs=43254)
                0.011436006 = queryNorm
              0.3954415 = fieldWeight in 1188, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6529322 = idf(docFreq=3046, maxDocs=43254)
                0.0625 = fieldNorm(doc=1188)
          0.04598545 = weight(abstract_txt:engines in 1188) [ClassicSimilarity], result of:
            0.04598545 = score(doc=1188,freq=1.0), product of:
              0.13896237 = queryWeight, product of:
                2.2949839 = boost
                5.2947226 = idf(docFreq=589, maxDocs=43254)
                0.011436006 = queryNorm
              0.33092016 = fieldWeight in 1188, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2947226 = idf(docFreq=589, maxDocs=43254)
                0.0625 = fieldNorm(doc=1188)
          0.576883 = weight(abstract_txt:invisible in 1188) [ClassicSimilarity], result of:
            0.576883 = score(doc=1188,freq=3.0), product of:
              0.68996537 = queryWeight, product of:
                7.8114753 = boost
                7.7236013 = idf(docFreq=51, maxDocs=43254)
                0.011436006 = queryNorm
              0.83610433 = fieldWeight in 1188, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.7236013 = idf(docFreq=51, maxDocs=43254)
                0.0625 = fieldNorm(doc=1188)
        0.2 = coord(5/25)
    
  4. Borgman, C.L.: ¬The invisible library : paradox of the global information infrastructure (2003) 0.09
    0.091070324 = sum of:
      0.091070324 = product of:
        0.75891936 = sum of:
          0.0066800816 = weight(abstract_txt:this in 2002) [ClassicSimilarity], result of:
            0.0066800816 = score(doc=2002,freq=1.0), product of:
              0.029305147 = queryWeight, product of:
                1.0539087 = boost
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.011436006 = queryNorm
              0.22794908 = fieldWeight in 2002, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.09375 = fieldNorm(doc=2002)
          0.045704648 = weight(abstract_txt:content in 2002) [ClassicSimilarity], result of:
            0.045704648 = score(doc=2002,freq=1.0), product of:
              0.11624544 = queryWeight, product of:
                2.423753 = boost
                4.193853 = idf(docFreq=1773, maxDocs=43254)
                0.011436006 = queryNorm
              0.3931737 = fieldWeight in 2002, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.193853 = idf(docFreq=1773, maxDocs=43254)
                0.09375 = fieldNorm(doc=2002)
          0.7065346 = weight(abstract_txt:invisible in 2002) [ClassicSimilarity], result of:
            0.7065346 = score(doc=2002,freq=2.0), product of:
              0.68996537 = queryWeight, product of:
                7.8114753 = boost
                7.7236013 = idf(docFreq=51, maxDocs=43254)
                0.011436006 = queryNorm
              1.0240146 = fieldWeight in 2002, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7236013 = idf(docFreq=51, maxDocs=43254)
                0.09375 = fieldNorm(doc=2002)
        0.12 = coord(3/25)
    
  5. Breeding, M.: Thinking about your next OPAC (2007) 0.08
    0.07657405 = sum of:
      0.07657405 = product of:
        0.6381171 = sum of:
          0.16030143 = weight(abstract_txt:that's in 1746) [ClassicSimilarity], result of:
            0.16030143 = score(doc=1746,freq=1.0), product of:
              0.13954516 = queryWeight, product of:
                1.327785 = boost
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.011436006 = queryNorm
              1.1487423 = fieldWeight in 1746, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.125 = fieldNorm(doc=1746)
          0.060939535 = weight(abstract_txt:content in 1746) [ClassicSimilarity], result of:
            0.060939535 = score(doc=1746,freq=1.0), product of:
              0.11624544 = queryWeight, product of:
                2.423753 = boost
                4.193853 = idf(docFreq=1773, maxDocs=43254)
                0.011436006 = queryNorm
              0.5242316 = fieldWeight in 1746, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.193853 = idf(docFreq=1773, maxDocs=43254)
                0.125 = fieldNorm(doc=1746)
          0.41687614 = weight(abstract_txt:it's in 1746) [ClassicSimilarity], result of:
            0.41687614 = score(doc=1746,freq=1.0), product of:
              0.418904 = queryWeight, product of:
                4.6010575 = boost
                7.9612727 = idf(docFreq=40, maxDocs=43254)
                0.011436006 = queryNorm
              0.9951591 = fieldWeight in 1746, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9612727 = idf(docFreq=40, maxDocs=43254)
                0.125 = fieldNorm(doc=1746)
        0.12 = coord(3/25)