Document (#25383)

Author
Ardö, A.
Koch, T.
Title
Automatic classification applied to full-text Internet documents in a robot-generated subject index
Source
Online information 99: 23rd International Online Information Meeting, Proceedings, London, 7-9 December 1999. Ed.: D. Raitt et al
Imprint
Hinskey Hill : Learned Information
Year
1999
Pages
S.239-246
Theme
Automatisches Klassifizieren
Suchmaschinen

Similar documents (author)

  1. Ardö, A.; Koch, T.: Lunds Universitets Elektroniska Bibliotek : Del.2: Gopher, World Wide Web (WWW). Planerade projekt (1993) 5.91
    5.9051695 = sum of:
      5.9051695 = sum of:
        1.974901 = weight(author_txt:koch in 6001) [ClassicSimilarity], result of:
          1.974901 = score(doc=6001,freq=1.0), product of:
            0.53427523 = queryWeight, product of:
              7.3928223 = idf(docFreq=73, maxDocs=44218)
              0.072269455 = queryNorm
            3.6964111 = fieldWeight in 6001, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.3928223 = idf(docFreq=73, maxDocs=44218)
              0.5 = fieldNorm(doc=6001)
        3.9302683 = weight(author_txt:ardö in 6001) [ClassicSimilarity], result of:
          3.9302683 = score(doc=6001,freq=1.0), product of:
            0.84531057 = queryWeight, product of:
              1.2578406 = boost
              9.298992 = idf(docFreq=10, maxDocs=44218)
              0.072269455 = queryNorm
            4.649496 = fieldWeight in 6001, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.298992 = idf(docFreq=10, maxDocs=44218)
              0.5 = fieldNorm(doc=6001)
    
  2. Ardö, A.; Koch, T.: Wide-area information server (WAIS) as the hub of an electronic library service at Lund University (1993) 5.91
    5.9051695 = sum of:
      5.9051695 = sum of:
        1.974901 = weight(author_txt:koch in 8459) [ClassicSimilarity], result of:
          1.974901 = score(doc=8459,freq=1.0), product of:
            0.53427523 = queryWeight, product of:
              7.3928223 = idf(docFreq=73, maxDocs=44218)
              0.072269455 = queryNorm
            3.6964111 = fieldWeight in 8459, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.3928223 = idf(docFreq=73, maxDocs=44218)
              0.5 = fieldNorm(doc=8459)
        3.9302683 = weight(author_txt:ardö in 8459) [ClassicSimilarity], result of:
          3.9302683 = score(doc=8459,freq=1.0), product of:
            0.84531057 = queryWeight, product of:
              1.2578406 = boost
              9.298992 = idf(docFreq=10, maxDocs=44218)
              0.072269455 = queryNorm
            4.649496 = fieldWeight in 8459, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.298992 = idf(docFreq=10, maxDocs=44218)
              0.5 = fieldNorm(doc=8459)
    
  3. Koch, T.; Ardö, A.: Automatic classification of full-text HTML-documents from one specific subject area : DESIRE II D3.6a, Working Paper 2 (2000) 5.91
    5.9051695 = sum of:
      5.9051695 = sum of:
        1.974901 = weight(author_txt:koch in 1667) [ClassicSimilarity], result of:
          1.974901 = score(doc=1667,freq=1.0), product of:
            0.53427523 = queryWeight, product of:
              7.3928223 = idf(docFreq=73, maxDocs=44218)
              0.072269455 = queryNorm
            3.6964111 = fieldWeight in 1667, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.3928223 = idf(docFreq=73, maxDocs=44218)
              0.5 = fieldNorm(doc=1667)
        3.9302683 = weight(author_txt:ardö in 1667) [ClassicSimilarity], result of:
          3.9302683 = score(doc=1667,freq=1.0), product of:
            0.84531057 = queryWeight, product of:
              1.2578406 = boost
              9.298992 = idf(docFreq=10, maxDocs=44218)
              0.072269455 = queryNorm
            4.649496 = fieldWeight in 1667, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.298992 = idf(docFreq=10, maxDocs=44218)
              0.5 = fieldNorm(doc=1667)
    
  4. Koch, T.; Ardö, A.; Noodén, L.: ¬The construction of a robot-generated subject index : DESIRE II D3.6a, Working Paper 1 (1999) 4.43
    4.428877 = sum of:
      4.428877 = sum of:
        1.4811757 = weight(author_txt:koch in 1668) [ClassicSimilarity], result of:
          1.4811757 = score(doc=1668,freq=1.0), product of:
            0.53427523 = queryWeight, product of:
              7.3928223 = idf(docFreq=73, maxDocs=44218)
              0.072269455 = queryNorm
            2.7723083 = fieldWeight in 1668, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.3928223 = idf(docFreq=73, maxDocs=44218)
              0.375 = fieldNorm(doc=1668)
        2.9477012 = weight(author_txt:ardö in 1668) [ClassicSimilarity], result of:
          2.9477012 = score(doc=1668,freq=1.0), product of:
            0.84531057 = queryWeight, product of:
              1.2578406 = boost
              9.298992 = idf(docFreq=10, maxDocs=44218)
              0.072269455 = queryNorm
            3.487122 = fieldWeight in 1668, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.298992 = idf(docFreq=10, maxDocs=44218)
              0.375 = fieldNorm(doc=1668)
    
  5. Koch, T.; Ardö, A.; Brümmer, A.: ¬The building and maintenance of robot based internet search services : A review of current indexing and data collection methods. Prepared to meet the requirements of Work Package 3 of EU Telematics for Research, project DESIRE. Version D3.11v0.3 (Draft version 3) (1996) 4.43
    4.428877 = sum of:
      4.428877 = sum of:
        1.4811757 = weight(author_txt:koch in 1669) [ClassicSimilarity], result of:
          1.4811757 = score(doc=1669,freq=1.0), product of:
            0.53427523 = queryWeight, product of:
              7.3928223 = idf(docFreq=73, maxDocs=44218)
              0.072269455 = queryNorm
            2.7723083 = fieldWeight in 1669, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.3928223 = idf(docFreq=73, maxDocs=44218)
              0.375 = fieldNorm(doc=1669)
        2.9477012 = weight(author_txt:ardö in 1669) [ClassicSimilarity], result of:
          2.9477012 = score(doc=1669,freq=1.0), product of:
            0.84531057 = queryWeight, product of:
              1.2578406 = boost
              9.298992 = idf(docFreq=10, maxDocs=44218)
              0.072269455 = queryNorm
            3.487122 = fieldWeight in 1669, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.298992 = idf(docFreq=10, maxDocs=44218)
              0.375 = fieldNorm(doc=1669)
    

Similar documents (content)

  1. Lindholm, J.; Schönthal, T.; Jansson , K.: Experiences of harvesting Web resources in engineering using automatic classification (2003) 0.83
    0.83421975 = sum of:
      0.83421975 = product of:
        1.5294029 = sum of:
          0.07409634 = weight(abstract_txt:subject in 4088) [ClassicSimilarity], result of:
            0.07409634 = score(doc=4088,freq=1.0), product of:
              0.15171944 = queryWeight, product of:
                1.0482496 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.03704512 = queryNorm
              0.48837733 = fieldWeight in 4088, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.125 = fieldNorm(doc=4088)
          0.07904172 = weight(abstract_txt:classification in 4088) [ClassicSimilarity], result of:
            0.07904172 = score(doc=4088,freq=1.0), product of:
              0.15839726 = queryWeight, product of:
                1.0710702 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.03704512 = queryNorm
              0.4990094 = fieldWeight in 4088, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.125 = fieldNorm(doc=4088)
          0.18817765 = weight(abstract_txt:index in 4088) [ClassicSimilarity], result of:
            0.18817765 = score(doc=4088,freq=2.0), product of:
              0.22415346 = queryWeight, product of:
                1.274139 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.03704512 = queryNorm
              0.83950365 = fieldWeight in 4088, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.125 = fieldNorm(doc=4088)
          0.1742476 = weight(abstract_txt:automatic in 4088) [ClassicSimilarity], result of:
            0.1742476 = score(doc=4088,freq=1.0), product of:
              0.26830035 = queryWeight, product of:
                1.3939742 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.03704512 = queryNorm
              0.6494497 = fieldWeight in 4088, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.125 = fieldNorm(doc=4088)
          0.20908275 = weight(abstract_txt:generated in 4088) [ClassicSimilarity], result of:
            0.20908275 = score(doc=4088,freq=1.0), product of:
              0.3029625 = queryWeight, product of:
                1.4812847 = boost
                5.52102 = idf(docFreq=480, maxDocs=44218)
                0.03704512 = queryNorm
              0.6901275 = fieldWeight in 4088, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.52102 = idf(docFreq=480, maxDocs=44218)
                0.125 = fieldNorm(doc=4088)
          0.8047569 = weight(abstract_txt:robot in 4088) [ClassicSimilarity], result of:
            0.8047569 = score(doc=4088,freq=1.0), product of:
              0.7440804 = queryWeight, product of:
                2.3214216 = boost
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.03704512 = queryNorm
              1.0815456 = fieldWeight in 4088, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.125 = fieldNorm(doc=4088)
        0.54545456 = coord(6/11)
    
  2. Ardö, A.; Godby, J.; Houghton, A.; Koch, T.; Reighart, R.; Thompson, R.; Vizine-Goetz, D.: Browsing engineering resources on the Web : a general knowledge organization scheme (Dewey) vs. a special scheme (EI) (2000) 0.82
    0.8162595 = sum of:
      0.8162595 = product of:
        1.2826935 = sum of:
          0.05557225 = weight(abstract_txt:subject in 86) [ClassicSimilarity], result of:
            0.05557225 = score(doc=86,freq=1.0), product of:
              0.15171944 = queryWeight, product of:
                1.0482496 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.03704512 = queryNorm
              0.366283 = fieldWeight in 86, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.09375 = fieldNorm(doc=86)
          0.10267821 = weight(abstract_txt:classification in 86) [ClassicSimilarity], result of:
            0.10267821 = score(doc=86,freq=3.0), product of:
              0.15839726 = queryWeight, product of:
                1.0710702 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.03704512 = queryNorm
              0.6482322 = fieldWeight in 86, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.09375 = fieldNorm(doc=86)
          0.0922444 = weight(abstract_txt:documents in 86) [ClassicSimilarity], result of:
            0.0922444 = score(doc=86,freq=2.0), product of:
              0.16881819 = queryWeight, product of:
                1.1057417 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.03704512 = queryNorm
              0.5464127 = fieldWeight in 86, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.09375 = fieldNorm(doc=86)
          0.14113323 = weight(abstract_txt:index in 86) [ClassicSimilarity], result of:
            0.14113323 = score(doc=86,freq=2.0), product of:
              0.22415346 = queryWeight, product of:
                1.274139 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.03704512 = queryNorm
              0.6296277 = fieldWeight in 86, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.09375 = fieldNorm(doc=86)
          0.13068569 = weight(abstract_txt:automatic in 86) [ClassicSimilarity], result of:
            0.13068569 = score(doc=86,freq=1.0), product of:
              0.26830035 = queryWeight, product of:
                1.3939742 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.03704512 = queryNorm
              0.48708728 = fieldWeight in 86, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.09375 = fieldNorm(doc=86)
          0.15681207 = weight(abstract_txt:generated in 86) [ClassicSimilarity], result of:
            0.15681207 = score(doc=86,freq=1.0), product of:
              0.3029625 = queryWeight, product of:
                1.4812847 = boost
                5.52102 = idf(docFreq=480, maxDocs=44218)
                0.03704512 = queryNorm
              0.51759565 = fieldWeight in 86, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.52102 = idf(docFreq=480, maxDocs=44218)
                0.09375 = fieldNorm(doc=86)
          0.60356766 = weight(abstract_txt:robot in 86) [ClassicSimilarity], result of:
            0.60356766 = score(doc=86,freq=1.0), product of:
              0.7440804 = queryWeight, product of:
                2.3214216 = boost
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.03704512 = queryNorm
              0.8111592 = fieldWeight in 86, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.09375 = fieldNorm(doc=86)
        0.6363636 = coord(7/11)
    
  3. Koch, T.; Ardö, A.; Noodén, L.: ¬The construction of a robot-generated subject index : DESIRE II D3.6a, Working Paper 1 (1999) 0.61
    0.61280423 = sum of:
      0.61280423 = product of:
        1.1234744 = sum of:
          0.04631021 = weight(abstract_txt:subject in 1668) [ClassicSimilarity], result of:
            0.04631021 = score(doc=1668,freq=1.0), product of:
              0.15171944 = queryWeight, product of:
                1.0482496 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.03704512 = queryNorm
              0.30523583 = fieldWeight in 1668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.078125 = fieldNorm(doc=1668)
          0.04940108 = weight(abstract_txt:classification in 1668) [ClassicSimilarity], result of:
            0.04940108 = score(doc=1668,freq=1.0), product of:
              0.15839726 = queryWeight, product of:
                1.0710702 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.03704512 = queryNorm
              0.3118809 = fieldWeight in 1668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.078125 = fieldNorm(doc=1668)
          0.07687034 = weight(abstract_txt:documents in 1668) [ClassicSimilarity], result of:
            0.07687034 = score(doc=1668,freq=2.0), product of:
              0.16881819 = queryWeight, product of:
                1.1057417 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.03704512 = queryNorm
              0.4553439 = fieldWeight in 1668, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=1668)
          0.10890475 = weight(abstract_txt:automatic in 1668) [ClassicSimilarity], result of:
            0.10890475 = score(doc=1668,freq=1.0), product of:
              0.26830035 = queryWeight, product of:
                1.3939742 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.03704512 = queryNorm
              0.40590608 = fieldWeight in 1668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.078125 = fieldNorm(doc=1668)
          0.13067672 = weight(abstract_txt:generated in 1668) [ClassicSimilarity], result of:
            0.13067672 = score(doc=1668,freq=1.0), product of:
              0.3029625 = queryWeight, product of:
                1.4812847 = boost
                5.52102 = idf(docFreq=480, maxDocs=44218)
                0.03704512 = queryNorm
              0.43132967 = fieldWeight in 1668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.52102 = idf(docFreq=480, maxDocs=44218)
                0.078125 = fieldNorm(doc=1668)
          0.7113113 = weight(abstract_txt:robot in 1668) [ClassicSimilarity], result of:
            0.7113113 = score(doc=1668,freq=2.0), product of:
              0.7440804 = queryWeight, product of:
                2.3214216 = boost
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.03704512 = queryNorm
              0.9559602 = fieldWeight in 1668, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.078125 = fieldNorm(doc=1668)
        0.54545456 = coord(6/11)
    
  4. Kimmel, S.: Robot-generated databases on the World Wide Web (1996) 0.47
    0.4669729 = sum of:
      0.4669729 = product of:
        1.2841754 = sum of:
          0.08215816 = weight(abstract_txt:text in 4724) [ClassicSimilarity], result of:
            0.08215816 = score(doc=4724,freq=1.0), product of:
              0.16253388 = queryWeight, product of:
                1.0849658 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.03704512 = queryNorm
              0.5054833 = fieldWeight in 4724, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.125 = fieldNorm(doc=4724)
          0.18817765 = weight(abstract_txt:index in 4724) [ClassicSimilarity], result of:
            0.18817765 = score(doc=4724,freq=2.0), product of:
              0.22415346 = queryWeight, product of:
                1.274139 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.03704512 = queryNorm
              0.83950365 = fieldWeight in 4724, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.125 = fieldNorm(doc=4724)
          0.20908275 = weight(abstract_txt:generated in 4724) [ClassicSimilarity], result of:
            0.20908275 = score(doc=4724,freq=1.0), product of:
              0.3029625 = queryWeight, product of:
                1.4812847 = boost
                5.52102 = idf(docFreq=480, maxDocs=44218)
                0.03704512 = queryNorm
              0.6901275 = fieldWeight in 4724, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.52102 = idf(docFreq=480, maxDocs=44218)
                0.125 = fieldNorm(doc=4724)
          0.8047569 = weight(abstract_txt:robot in 4724) [ClassicSimilarity], result of:
            0.8047569 = score(doc=4724,freq=1.0), product of:
              0.7440804 = queryWeight, product of:
                2.3214216 = boost
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.03704512 = queryNorm
              1.0815456 = fieldWeight in 4724, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.125 = fieldNorm(doc=4724)
        0.36363637 = coord(4/11)
    
  5. Kimmel, S.: WWW search tools in reference services (1997) 0.45
    0.44506472 = sum of:
      0.44506472 = product of:
        1.6319039 = sum of:
          0.1111445 = weight(abstract_txt:subject in 619) [ClassicSimilarity], result of:
            0.1111445 = score(doc=619,freq=1.0), product of:
              0.15171944 = queryWeight, product of:
                1.0482496 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.03704512 = queryNorm
              0.732566 = fieldWeight in 619, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.1875 = fieldNorm(doc=619)
          0.31362414 = weight(abstract_txt:generated in 619) [ClassicSimilarity], result of:
            0.31362414 = score(doc=619,freq=1.0), product of:
              0.3029625 = queryWeight, product of:
                1.4812847 = boost
                5.52102 = idf(docFreq=480, maxDocs=44218)
                0.03704512 = queryNorm
              1.0351913 = fieldWeight in 619, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.52102 = idf(docFreq=480, maxDocs=44218)
                0.1875 = fieldNorm(doc=619)
          1.2071353 = weight(abstract_txt:robot in 619) [ClassicSimilarity], result of:
            1.2071353 = score(doc=619,freq=1.0), product of:
              0.7440804 = queryWeight, product of:
                2.3214216 = boost
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.03704512 = queryNorm
              1.6223184 = fieldWeight in 619, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.1875 = fieldNorm(doc=619)
        0.27272728 = coord(3/11)