Search (67 results, page 1 of 4)

  • × theme_ss:"Suchmaschinen"
  • × year_i:[1990 TO 2000}
  1. Page, L.; Brin, S.; Motwani, R.; Winograd, T.: ¬The PageRank citation ranking : Bringing order to the Web (1999) 0.08
    0.08028441 = product of:
      0.24085322 = sum of:
        0.24085322 = weight(_text_:citation in 496) [ClassicSimilarity], result of:
          0.24085322 = score(doc=496,freq=4.0), product of:
            0.23479973 = queryWeight, product of:
              4.6892867 = idf(docFreq=1104, maxDocs=44218)
              0.050071523 = queryNorm
            1.0257815 = fieldWeight in 496, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              4.6892867 = idf(docFreq=1104, maxDocs=44218)
              0.109375 = fieldNorm(doc=496)
      0.33333334 = coord(1/3)
    
    Theme
    Citation indexing
  2. Hancock, B.: Subject-specific search engines : using the Harvest system to gather and maintain information on the Internet (1998) 0.06
    0.0585216 = product of:
      0.1755648 = sum of:
        0.1755648 = sum of:
          0.12807679 = weight(_text_:index in 3238) [ClassicSimilarity], result of:
            0.12807679 = score(doc=3238,freq=6.0), product of:
              0.21880072 = queryWeight, product of:
                4.369764 = idf(docFreq=1520, maxDocs=44218)
                0.050071523 = queryNorm
              0.5853582 = fieldWeight in 3238, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.369764 = idf(docFreq=1520, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3238)
          0.047487997 = weight(_text_:22 in 3238) [ClassicSimilarity], result of:
            0.047487997 = score(doc=3238,freq=2.0), product of:
              0.17534193 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.050071523 = queryNorm
              0.2708308 = fieldWeight in 3238, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3238)
      0.33333334 = coord(1/3)
    
    Abstract
    The increasing expansion of the Internet has made resources available to users in sometimes unmanageable abundance. To help users manage this proliferation of information, librarians have begun to add URLs to their home pages. As well, specialized search engines are being used to retrieve information from selected sources in aneffort to return pertinent results. Describes the Harvest system which has been used to develop Index Antiquus, a specialized engine, for the classics and mediaeval studies. Presents a working example of how to search Index Antiquus
    Date
    6. 3.1997 16:22:15
    Object
    Index Antiquus
  3. Hüskes, R.; Kleber, D.: ¬Den Server im Griff (1999) 0.06
    0.05782532 = product of:
      0.17347595 = sum of:
        0.17347595 = sum of:
          0.10563595 = weight(_text_:index in 4008) [ClassicSimilarity], result of:
            0.10563595 = score(doc=4008,freq=2.0), product of:
              0.21880072 = queryWeight, product of:
                4.369764 = idf(docFreq=1520, maxDocs=44218)
                0.050071523 = queryNorm
              0.48279524 = fieldWeight in 4008, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.369764 = idf(docFreq=1520, maxDocs=44218)
                0.078125 = fieldNorm(doc=4008)
          0.06784 = weight(_text_:22 in 4008) [ClassicSimilarity], result of:
            0.06784 = score(doc=4008,freq=2.0), product of:
              0.17534193 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.050071523 = queryNorm
              0.38690117 = fieldWeight in 4008, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.078125 = fieldNorm(doc=4008)
      0.33333334 = coord(1/3)
    
    Date
    22. 8.1999 21:21:10
    Object
    Microsoft Index Server
  4. Ardo, A.; Lundberg, S.: ¬A regional distributed WWW search and indexing service : the DESIRE way (1998) 0.04
    0.04344636 = product of:
      0.13033907 = sum of:
        0.13033907 = sum of:
          0.089635074 = weight(_text_:index in 4190) [ClassicSimilarity], result of:
            0.089635074 = score(doc=4190,freq=4.0), product of:
              0.21880072 = queryWeight, product of:
                4.369764 = idf(docFreq=1520, maxDocs=44218)
                0.050071523 = queryNorm
              0.40966535 = fieldWeight in 4190, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.369764 = idf(docFreq=1520, maxDocs=44218)
                0.046875 = fieldNorm(doc=4190)
          0.040703997 = weight(_text_:22 in 4190) [ClassicSimilarity], result of:
            0.040703997 = score(doc=4190,freq=2.0), product of:
              0.17534193 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.050071523 = queryNorm
              0.23214069 = fieldWeight in 4190, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.046875 = fieldNorm(doc=4190)
      0.33333334 = coord(1/3)
    
    Abstract
    Creates an open, metadata aware system for distributed, collaborative WWW indexing. The system has 3 main components: a harvester (for collecting information), a database (for making the collection searchable), and a user interface (for making the information available). all components can be distributed across networked computers, thus supporting scalability. The system is metadata aware and thus allows searches on several fields including title, document author and URL. Nordic Web Index (NWI) is an application using this system to create a regional Nordic Web-indexing service. NWI is built using 5 collaborating service points within the Nordic countries. The NWI databases can be used to build additional services
    Date
    1. 8.1996 22:08:06
    Object
    Nordic Web Index
  5. Duval, B.K.; Main, L.: Searching the Internet : part 2 trail-blazers (1997) 0.03
    0.03469519 = product of:
      0.104085565 = sum of:
        0.104085565 = sum of:
          0.06338157 = weight(_text_:index in 858) [ClassicSimilarity], result of:
            0.06338157 = score(doc=858,freq=2.0), product of:
              0.21880072 = queryWeight, product of:
                4.369764 = idf(docFreq=1520, maxDocs=44218)
                0.050071523 = queryNorm
              0.28967714 = fieldWeight in 858, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.369764 = idf(docFreq=1520, maxDocs=44218)
                0.046875 = fieldNorm(doc=858)
          0.040703997 = weight(_text_:22 in 858) [ClassicSimilarity], result of:
            0.040703997 = score(doc=858,freq=2.0), product of:
              0.17534193 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.050071523 = queryNorm
              0.23214069 = fieldWeight in 858, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.046875 = fieldNorm(doc=858)
      0.33333334 = coord(1/3)
    
    Abstract
    Presents a guide to searching for information on the Internet covering Research-It; familiar quotations: a collection of passages, phrases and proverbs traced to their sources in ancient and modern literature by John Bartlett; the Internet Public Library Reference Center; SearchERIC Database; Britannica Online; Britannica's Lives; The complete works of William Shakespeare; Flicks/Movie Schedules and Reviews; the Electronic Newsstand; CNN Interactive; Time Warner's Pathfinder; Electronic Newspapers from all 50 States; Yahoo, News; Newspapers; Techweb; ZDNet; the On-line Books Page; Columbia University Bartleby Library; the Children's Literature Web Guide; National Institutes of Health; US Census Bureau; Earthquake Info; US Postal Service Zip+4 Lookup; the Federal Web Locator; World Wide Web Virtual Library; US Government Information Sources; Index of the Constitution of the US; US States Code; Find California Code; Dearch for Bills; California Tenant's Rights; The Online Career Center; QuickAID Home Page; City.Net; Netscape's Destinations Button; International Telephone Directory; World Alumni Net; Archives of Adoptees and Birth Parents; and World Wide Registry Matching Adoptees with Birth Parents
    Date
    6. 3.1997 16:22:15
  6. Sieverts, E.: Citatie-zoeken op het Web (1997) 0.03
    0.032439798 = product of:
      0.097319394 = sum of:
        0.097319394 = weight(_text_:citation in 143) [ClassicSimilarity], result of:
          0.097319394 = score(doc=143,freq=2.0), product of:
            0.23479973 = queryWeight, product of:
              4.6892867 = idf(docFreq=1104, maxDocs=44218)
              0.050071523 = queryNorm
            0.4144783 = fieldWeight in 143, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.6892867 = idf(docFreq=1104, maxDocs=44218)
              0.0625 = fieldNorm(doc=143)
      0.33333334 = coord(1/3)
    
    Footnote
    Übers. d. Titels: Citation searching on the Web
  7. Leighton, H.V.: Performance of four World Wide Web (WWW) index services : Infoseek, Lycos, WebCrawler and WWWWorm (1995) 0.02
    0.02112719 = product of:
      0.06338157 = sum of:
        0.06338157 = product of:
          0.12676314 = sum of:
            0.12676314 = weight(_text_:index in 3168) [ClassicSimilarity], result of:
              0.12676314 = score(doc=3168,freq=2.0), product of:
                0.21880072 = queryWeight, product of:
                  4.369764 = idf(docFreq=1520, maxDocs=44218)
                  0.050071523 = queryNorm
                0.5793543 = fieldWeight in 3168, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.369764 = idf(docFreq=1520, maxDocs=44218)
                  0.09375 = fieldNorm(doc=3168)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
  8. Koch, T.: Searching the Web : systematic overview over indexes (1995) 0.02
    0.02112719 = product of:
      0.06338157 = sum of:
        0.06338157 = product of:
          0.12676314 = sum of:
            0.12676314 = weight(_text_:index in 3169) [ClassicSimilarity], result of:
              0.12676314 = score(doc=3169,freq=2.0), product of:
                0.21880072 = queryWeight, product of:
                  4.369764 = idf(docFreq=1520, maxDocs=44218)
                  0.050071523 = queryNorm
                0.5793543 = fieldWeight in 3169, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.369764 = idf(docFreq=1520, maxDocs=44218)
                  0.09375 = fieldNorm(doc=3169)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Object
    Nordic Web Index
  9. Collier, H.: Needles in electronic haystacks (1996) 0.02
    0.02112719 = product of:
      0.06338157 = sum of:
        0.06338157 = product of:
          0.12676314 = sum of:
            0.12676314 = weight(_text_:index in 5791) [ClassicSimilarity], result of:
              0.12676314 = score(doc=5791,freq=2.0), product of:
                0.21880072 = queryWeight, product of:
                  4.369764 = idf(docFreq=1520, maxDocs=44218)
                  0.050071523 = queryNorm
                0.5793543 = fieldWeight in 5791, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.369764 = idf(docFreq=1520, maxDocs=44218)
                  0.09375 = fieldNorm(doc=5791)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    Briefly comments on the main features of the HotBot WWW search engine, which claims to index 53 million web pages, and uses the Inktomi search engine on an interface designed by HotWired
  10. Ardö, A.; Koch, T.: Automatic classification applied to full-text Internet documents in a robot-generated subject index (1999) 0.02
    0.02112719 = product of:
      0.06338157 = sum of:
        0.06338157 = product of:
          0.12676314 = sum of:
            0.12676314 = weight(_text_:index in 382) [ClassicSimilarity], result of:
              0.12676314 = score(doc=382,freq=2.0), product of:
                0.21880072 = queryWeight, product of:
                  4.369764 = idf(docFreq=1520, maxDocs=44218)
                  0.050071523 = queryNorm
                0.5793543 = fieldWeight in 382, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.369764 = idf(docFreq=1520, maxDocs=44218)
                  0.09375 = fieldNorm(doc=382)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
  11. Herring, S.D.: ¬The value of interdisciplinarity : a study based on the design of Internet search engines (1999) 0.02
    0.020274874 = product of:
      0.06082462 = sum of:
        0.06082462 = weight(_text_:citation in 3458) [ClassicSimilarity], result of:
          0.06082462 = score(doc=3458,freq=2.0), product of:
            0.23479973 = queryWeight, product of:
              4.6892867 = idf(docFreq=1104, maxDocs=44218)
              0.050071523 = queryNorm
            0.25904894 = fieldWeight in 3458, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.6892867 = idf(docFreq=1104, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3458)
      0.33333334 = coord(1/3)
    
    Abstract
    Continued development of the Internet requires the development of efficient, easy-to-use search engines. Ideally, such development should call upon knowledge and skills from a variety of disciplines, including computer science, information science, psychology, and ergonomics. The current study is intended to determine whether search engines shows a pattern of interdisciplinarity. 2 disciplines were selected as the focus for the study: computer science, and library/information science. A citation analysis was conducted to measure levels of interdisciplinary research and publishing in Internet search engine design and development. The results show a higher level of interdisciplinarity among library and information scientists than among computer scientists or among any of those categorized as 'other'. This is reflected both in the types of journals in which the authors publish, and in the references they cite to support their work. However, almost no authors published articles or cited references in fields such as cognitive science, ergonomics, or psychology. The results of this study are analyzed in terms of the writings of Patrick Wilson, Bruno Latour, Pierre Bordieu, Fritz Ringer, and Thomas Pinelli, focusing on cognitive authority within a profession, interaction between disciplines, and information-gathering habits of professionals. Suggestions for further research are given
  12. Van der Walt, M.: ¬The structure of classification schemes used in Internet search engines (1998) 0.02
    0.020274874 = product of:
      0.06082462 = sum of:
        0.06082462 = weight(_text_:citation in 84) [ClassicSimilarity], result of:
          0.06082462 = score(doc=84,freq=2.0), product of:
            0.23479973 = queryWeight, product of:
              4.6892867 = idf(docFreq=1104, maxDocs=44218)
              0.050071523 = queryNorm
            0.25904894 = fieldWeight in 84, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.6892867 = idf(docFreq=1104, maxDocs=44218)
              0.0390625 = fieldNorm(doc=84)
      0.33333334 = coord(1/3)
    
    Abstract
    The purpose of this paper is to determine some of the structural features of the classification schemes used in the directories (guides, channels) of search engines to organise information sources on the Internet. Ten search engines were examined at the main class level and the full hierarchies of a sample of three specific subjects were analysed in four of these engines, namely Excite, Infoseek, Lycos and Yahoo! It was found that there are major differences between the main classes of the search engines and those found in standard library schemes like Dewey, UDC and LCC. There are large gaps in subject coverage at main class level in the search engines and the general tendency is to use a topic-based approach in the formation of classes, rather than a discipline-based approach. The subdivision of the main classes is according to hierarchical tree structures, but a number of anomalies in this regard were identified. Another deviation from library classification theory is that various principles of division are employed to form classes at the same hierarchical level. In an analysis of citation orders many examples were found that conform to the principles followed in library classifications, but a number of inconsistencies in this regard were also noted
  13. Raeder, A.: Cataloguing the Web (1995) 0.02
    0.019918907 = product of:
      0.05975672 = sum of:
        0.05975672 = product of:
          0.11951344 = sum of:
            0.11951344 = weight(_text_:index in 3387) [ClassicSimilarity], result of:
              0.11951344 = score(doc=3387,freq=4.0), product of:
                0.21880072 = queryWeight, product of:
                  4.369764 = idf(docFreq=1520, maxDocs=44218)
                  0.050071523 = queryNorm
                0.5462205 = fieldWeight in 3387, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.369764 = idf(docFreq=1520, maxDocs=44218)
                  0.0625 = fieldNorm(doc=3387)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    Lists and describes sites that attempt to aid Internet searchers by helping them locate sites, files or information. Gives an overview of the methods used. Covers the following sides: Aliweb, ArchiPlex Archie Gateway, CUI W3, Clearing House for Subject Oriented Internet Resource Guide, InfoSeek, JumpStation, Lawrence Livermore National Laboratories List of Lists, Lycos WWW Search Engine, Mother of all BBSs, NIKOS, Plant Earth Home Page, Standford Newnews Filtering Service, WWW Home Page Harvest Browser, WWW virtual Library, WWW Wanderer Index, WWW Worm, Web Crawler, Whole Internet Catalog, and Yahoo Index to the Internet
  14. Kimmel, S.: Robot-generated databases on the World Wide Web (1996) 0.02
    0.019918907 = product of:
      0.05975672 = sum of:
        0.05975672 = product of:
          0.11951344 = sum of:
            0.11951344 = weight(_text_:index in 4724) [ClassicSimilarity], result of:
              0.11951344 = score(doc=4724,freq=4.0), product of:
                0.21880072 = queryWeight, product of:
                  4.369764 = idf(docFreq=1520, maxDocs=44218)
                  0.050071523 = queryNorm
                0.5462205 = fieldWeight in 4724, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.369764 = idf(docFreq=1520, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4724)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    WWW robots are programs that attempt to gather and index WWW resources. They reside on a host computer and retrieve information from sites on the WWW using standrad protocols. Gives an overview of robots, and robot generated databases. Covers: WWW Worm; Lycos, WebCrawler; AliWeb; Harvest; Jumpstation II, and Open Text Index. Also discusses Yahoo and Trade Wave which are comparable tools for resource discovery
  15. Moody, G.: Searching the Web for gigabucks (1996) 0.02
    0.019918907 = product of:
      0.05975672 = sum of:
        0.05975672 = product of:
          0.11951344 = sum of:
            0.11951344 = weight(_text_:index in 5603) [ClassicSimilarity], result of:
              0.11951344 = score(doc=5603,freq=4.0), product of:
                0.21880072 = queryWeight, product of:
                  4.369764 = idf(docFreq=1520, maxDocs=44218)
                  0.050071523 = queryNorm
                0.5462205 = fieldWeight in 5603, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.369764 = idf(docFreq=1520, maxDocs=44218)
                  0.0625 = fieldNorm(doc=5603)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    Following the success of Netscape, the software used to search the WWW, predicts that the next generation of web search engines will emerge from the ranks of a new class of Internet company. Discusses the challenges facing search engines in indexing the vast quantities of data available and growing on the WWW and the use of spiders, crawlers and worms. Notes that, while AltaVista can index 2,5 million web pages a day, a revised Inktomi search engine will be able to index 10 million pages per day
  16. Großjohann, K.: Gathering-, Harvesting-, Suchmaschinen (1996) 0.02
    0.01918805 = product of:
      0.05756415 = sum of:
        0.05756415 = product of:
          0.1151283 = sum of:
            0.1151283 = weight(_text_:22 in 3227) [ClassicSimilarity], result of:
              0.1151283 = score(doc=3227,freq=4.0), product of:
                0.17534193 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.050071523 = queryNorm
                0.6565931 = fieldWeight in 3227, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=3227)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    7. 2.1996 22:38:41
    Pages
    22 S
  17. Höfer, W.: Detektive im Web (1999) 0.02
    0.01918805 = product of:
      0.05756415 = sum of:
        0.05756415 = product of:
          0.1151283 = sum of:
            0.1151283 = weight(_text_:22 in 4007) [ClassicSimilarity], result of:
              0.1151283 = score(doc=4007,freq=4.0), product of:
                0.17534193 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.050071523 = queryNorm
                0.6565931 = fieldWeight in 4007, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=4007)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    22. 8.1999 20:22:06
  18. Rensman, J.: Blick ins Getriebe (1999) 0.02
    0.01918805 = product of:
      0.05756415 = sum of:
        0.05756415 = product of:
          0.1151283 = sum of:
            0.1151283 = weight(_text_:22 in 4009) [ClassicSimilarity], result of:
              0.1151283 = score(doc=4009,freq=4.0), product of:
                0.17534193 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.050071523 = queryNorm
                0.6565931 = fieldWeight in 4009, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=4009)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    22. 8.1999 21:22:59
  19. Lawrence, S.; Giles, C.L.: Accessibility and distribution of information on the Web (1999) 0.02
    0.018296685 = product of:
      0.05489005 = sum of:
        0.05489005 = product of:
          0.1097801 = sum of:
            0.1097801 = weight(_text_:index in 4952) [ClassicSimilarity], result of:
              0.1097801 = score(doc=4952,freq=6.0), product of:
                0.21880072 = queryWeight, product of:
                  4.369764 = idf(docFreq=1520, maxDocs=44218)
                  0.050071523 = queryNorm
                0.50173557 = fieldWeight in 4952, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  4.369764 = idf(docFreq=1520, maxDocs=44218)
                  0.046875 = fieldNorm(doc=4952)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    Search engine coverage relative to the estimated size of the publicly indexable web has decreased substantially since December 97, with no engine indexing more than about 16% of the estimated size of the publicly indexable web. (Note that many queries can be satisfied with a relatively small database). Search engines are typically more likely to index sites that have more links to them (more 'popular' sites). They are also typically more likely to index US sites than non-US sites (AltaVista is an exception), and more likely to index commercial sites than educational sites. Indexing of new or modified pages byjust one of the major search engines can take months. 83% of sites contain commercial content and 6% contain scientific or educational content. Only 1.5% of sites contain pornographic content. The publicly indexable web contains an estimated 800 million pages as of February 1999, encompassing about 15 terabytes of information or about 6 terabytes of text after removing HTML tags, comments, and extra whitespace. The simple HTML "keywords" and "description" metatags are only used on the homepages of 34% of sites. Only 0.3% of sites use the Dublin Core metadata standard.
  20. Matrix of WWW indices : a comparison of Internet indexing tools (1995) 0.02
    0.017605992 = product of:
      0.052817974 = sum of:
        0.052817974 = product of:
          0.10563595 = sum of:
            0.10563595 = weight(_text_:index in 3165) [ClassicSimilarity], result of:
              0.10563595 = score(doc=3165,freq=2.0), product of:
                0.21880072 = queryWeight, product of:
                  4.369764 = idf(docFreq=1520, maxDocs=44218)
                  0.050071523 = queryNorm
                0.48279524 = fieldWeight in 3165, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.369764 = idf(docFreq=1520, maxDocs=44218)
                  0.078125 = fieldNorm(doc=3165)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Object
    GNA Meta-Index

Languages

  • e 47
  • d 17
  • nl 2
  • f 1
  • More… Less…

Types

  • a 57
  • el 9
  • p 2
  • m 1
  • More… Less…