Document (#29133)

Author
Shen, D.
Chen, Z.
Yang, Q.
Zeng, H.J.
Zhang, B.
Lu, Y.
Ma, W.Y.
Title
Web page classification through summarization
Source
SIGIR'04: Proceedings of the 27th Annual International ACM-SIGIR Conference an Research and Development in Information Retrieval. Ed.: K. Järvelin, u.a
Imprint
New York, NY : ACM Press
Year
2004
Pages
S.242-249
Theme
Automatisches Klassifizieren

Similar documents (author)

  1. Shen, D.; Yang, Q.; Chen, Z.: Noise reduction through summarization for Web-page classification (2007) 2.31
    2.3061898 = sum of:
      2.3061898 = product of:
        3.8436494 = sum of:
          0.7256452 = weight(author_txt:chen in 953) [ClassicSimilarity], result of:
            0.7256452 = score(doc=953,freq=1.0), product of:
              0.31455547 = queryWeight, product of:
                6.1517096 = idf(docFreq=255, maxDocs=44218)
                0.051133018 = queryNorm
              2.306891 = fieldWeight in 953, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1517096 = idf(docFreq=255, maxDocs=44218)
                0.375 = fieldNorm(doc=953)
          1.1619982 = weight(author_txt:yang in 953) [ClassicSimilarity], result of:
            1.1619982 = score(doc=953,freq=1.0), product of:
              0.43054447 = queryWeight, product of:
                1.1699313 = boost
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.051133018 = queryNorm
              2.698904 = fieldWeight in 953, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.375 = fieldNorm(doc=953)
          1.9560059 = weight(author_txt:shen in 953) [ClassicSimilarity], result of:
            1.9560059 = score(doc=953,freq=1.0), product of:
              0.6092485 = queryWeight, product of:
                1.3917096 = boost
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.051133018 = queryNorm
              3.2105222 = fieldWeight in 953, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.375 = fieldNorm(doc=953)
        0.6 = coord(3/5)
    
  2. Zhang, J.; Zeng, M.L.: ¬A new similarity measure for subject hierarchical structures (2014) 1.16
    1.1617616 = sum of:
      1.1617616 = product of:
        2.904404 = sum of:
          1.1017154 = weight(author_txt:zhang in 1778) [ClassicSimilarity], result of:
            1.1017154 = score(doc=1778,freq=1.0), product of:
              0.3430058 = queryWeight, product of:
                1.0442443 = boost
                6.4238877 = idf(docFreq=194, maxDocs=44218)
                0.051133018 = queryNorm
              3.2119439 = fieldWeight in 1778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4238877 = idf(docFreq=194, maxDocs=44218)
                0.5 = fieldNorm(doc=1778)
          1.8026885 = weight(author_txt:zeng in 1778) [ClassicSimilarity], result of:
            1.8026885 = score(doc=1778,freq=1.0), product of:
              0.47628728 = queryWeight, product of:
                1.230512 = boost
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.051133018 = queryNorm
              3.7848763 = fieldWeight in 1778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.5 = fieldNorm(doc=1778)
        0.4 = coord(2/5)
    
  3. Shen, X.-L.; Zhang, K.Z.K.; Zhao, S.J.: Herd behavior in consumers' adoption of online reviews (2016) 1.11
    1.112917 = sum of:
      1.112917 = product of:
        2.7822924 = sum of:
          0.82628655 = weight(author_txt:zhang in 3157) [ClassicSimilarity], result of:
            0.82628655 = score(doc=3157,freq=1.0), product of:
              0.3430058 = queryWeight, product of:
                1.0442443 = boost
                6.4238877 = idf(docFreq=194, maxDocs=44218)
                0.051133018 = queryNorm
              2.408958 = fieldWeight in 3157, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4238877 = idf(docFreq=194, maxDocs=44218)
                0.375 = fieldNorm(doc=3157)
          1.9560059 = weight(author_txt:shen in 3157) [ClassicSimilarity], result of:
            1.9560059 = score(doc=3157,freq=1.0), product of:
              0.6092485 = queryWeight, product of:
                1.3917096 = boost
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.051133018 = queryNorm
              3.2105222 = fieldWeight in 3157, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.375 = fieldNorm(doc=3157)
        0.4 = coord(2/5)
    
  4. Zeng, M.L.; Chen, Y.: Features of an integrated thesaurus management and search system for the networked environment (2003) 1.11
    1.1080862 = sum of:
      1.1080862 = product of:
        2.7702155 = sum of:
          0.9675269 = weight(author_txt:chen in 3817) [ClassicSimilarity], result of:
            0.9675269 = score(doc=3817,freq=1.0), product of:
              0.31455547 = queryWeight, product of:
                6.1517096 = idf(docFreq=255, maxDocs=44218)
                0.051133018 = queryNorm
              3.0758548 = fieldWeight in 3817, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1517096 = idf(docFreq=255, maxDocs=44218)
                0.5 = fieldNorm(doc=3817)
          1.8026885 = weight(author_txt:zeng in 3817) [ClassicSimilarity], result of:
            1.8026885 = score(doc=3817,freq=1.0), product of:
              0.47628728 = queryWeight, product of:
                1.230512 = boost
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.051133018 = queryNorm
              3.7848763 = fieldWeight in 3817, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.5 = fieldNorm(doc=3817)
        0.4 = coord(2/5)
    
  5. Zhang, M.; Yang, C.C.: Using content and network analysis to understand the social support exchange patterns and user behaviors of an online smoking cessation intervention program (2015) 1.06
    1.0604185 = sum of:
      1.0604185 = product of:
        2.6510463 = sum of:
          1.1017154 = weight(author_txt:zhang in 1668) [ClassicSimilarity], result of:
            1.1017154 = score(doc=1668,freq=1.0), product of:
              0.3430058 = queryWeight, product of:
                1.0442443 = boost
                6.4238877 = idf(docFreq=194, maxDocs=44218)
                0.051133018 = queryNorm
              3.2119439 = fieldWeight in 1668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4238877 = idf(docFreq=194, maxDocs=44218)
                0.5 = fieldNorm(doc=1668)
          1.549331 = weight(author_txt:yang in 1668) [ClassicSimilarity], result of:
            1.549331 = score(doc=1668,freq=1.0), product of:
              0.43054447 = queryWeight, product of:
                1.1699313 = boost
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.051133018 = queryNorm
              3.5985389 = fieldWeight in 1668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.5 = fieldNorm(doc=1668)
        0.4 = coord(2/5)
    

Similar documents (content)

  1. Shen, D.; Yang, Q.; Chen, Z.: Noise reduction through summarization for Web-page classification (2007) 1.93
    1.9297643 = sum of:
      1.9297643 = sum of:
        0.18367028 = weight(abstract_txt:classification in 953) [ClassicSimilarity], result of:
          0.18367028 = score(doc=953,freq=6.0), product of:
            0.24042216 = queryWeight, product of:
              3.9920752 = idf(docFreq=2218, maxDocs=44218)
              0.060224857 = queryNorm
            0.76394904 = fieldWeight in 953, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.9920752 = idf(docFreq=2218, maxDocs=44218)
              0.078125 = fieldNorm(doc=953)
        0.07606501 = weight(abstract_txt:through in 953) [ClassicSimilarity], result of:
          0.07606501 = score(doc=953,freq=1.0), product of:
            0.24272934 = queryWeight, product of:
              1.0047867 = boost
              4.011184 = idf(docFreq=2176, maxDocs=44218)
              0.060224857 = queryNorm
            0.31337377 = fieldWeight in 953, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              4.011184 = idf(docFreq=2176, maxDocs=44218)
              0.078125 = fieldNorm(doc=953)
        0.622478 = weight(abstract_txt:page in 953) [ClassicSimilarity], result of:
          0.622478 = score(doc=953,freq=6.0), product of:
            0.5424561 = queryWeight, product of:
              1.5020869 = boost
              5.9964437 = idf(docFreq=298, maxDocs=44218)
              0.060224857 = queryNorm
            1.1475178 = fieldWeight in 953, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              5.9964437 = idf(docFreq=298, maxDocs=44218)
              0.078125 = fieldNorm(doc=953)
        1.047551 = weight(abstract_txt:summarization in 953) [ClassicSimilarity], result of:
          1.047551 = score(doc=953,freq=6.0), product of:
            0.767477 = queryWeight, product of:
              1.7866745 = boost
              7.132539 = idf(docFreq=95, maxDocs=44218)
              0.060224857 = queryNorm
            1.3649282 = fieldWeight in 953, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              7.132539 = idf(docFreq=95, maxDocs=44218)
              0.078125 = fieldNorm(doc=953)
    
  2. Huo, W.: Automatic multi-word term extraction and its application to Web-page summarization (2012) 0.88
    0.88132596 = sum of:
      0.88132596 = product of:
        1.1751013 = sum of:
          0.074983075 = weight(abstract_txt:classification in 563) [ClassicSimilarity], result of:
            0.074983075 = score(doc=563,freq=1.0), product of:
              0.24042216 = queryWeight, product of:
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.060224857 = queryNorm
              0.3118809 = fieldWeight in 563, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.078125 = fieldNorm(doc=563)
          0.35938784 = weight(abstract_txt:page in 563) [ClassicSimilarity], result of:
            0.35938784 = score(doc=563,freq=2.0), product of:
              0.5424561 = queryWeight, product of:
                1.5020869 = boost
                5.9964437 = idf(docFreq=298, maxDocs=44218)
                0.060224857 = queryNorm
              0.6625197 = fieldWeight in 563, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9964437 = idf(docFreq=298, maxDocs=44218)
                0.078125 = fieldNorm(doc=563)
          0.74073035 = weight(abstract_txt:summarization in 563) [ClassicSimilarity], result of:
            0.74073035 = score(doc=563,freq=3.0), product of:
              0.767477 = queryWeight, product of:
                1.7866745 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.060224857 = queryNorm
              0.96514994 = fieldWeight in 563, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.078125 = fieldNorm(doc=563)
        0.75 = coord(3/4)
    
  3. Balas, J.: Dewey and the net (1996) 0.44
    0.43545285 = sum of:
      0.43545285 = product of:
        0.8709057 = sum of:
          0.15213002 = weight(abstract_txt:through in 4704) [ClassicSimilarity], result of:
            0.15213002 = score(doc=4704,freq=1.0), product of:
              0.24272934 = queryWeight, product of:
                1.0047867 = boost
                4.011184 = idf(docFreq=2176, maxDocs=44218)
                0.060224857 = queryNorm
              0.62674755 = fieldWeight in 4704, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.011184 = idf(docFreq=2176, maxDocs=44218)
                0.15625 = fieldNorm(doc=4704)
          0.7187757 = weight(abstract_txt:page in 4704) [ClassicSimilarity], result of:
            0.7187757 = score(doc=4704,freq=2.0), product of:
              0.5424561 = queryWeight, product of:
                1.5020869 = boost
                5.9964437 = idf(docFreq=298, maxDocs=44218)
                0.060224857 = queryNorm
              1.3250394 = fieldWeight in 4704, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9964437 = idf(docFreq=298, maxDocs=44218)
                0.15625 = fieldNorm(doc=4704)
        0.5 = coord(2/4)
    
  4. Over, P.; Dang, H.; Harman, D.: DUC in context (2007) 0.35
    0.35260814 = sum of:
      0.35260814 = product of:
        0.7052163 = sum of:
          0.10649101 = weight(abstract_txt:through in 934) [ClassicSimilarity], result of:
            0.10649101 = score(doc=934,freq=1.0), product of:
              0.24272934 = queryWeight, product of:
                1.0047867 = boost
                4.011184 = idf(docFreq=2176, maxDocs=44218)
                0.060224857 = queryNorm
              0.43872327 = fieldWeight in 934, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.011184 = idf(docFreq=2176, maxDocs=44218)
                0.109375 = fieldNorm(doc=934)
          0.59872526 = weight(abstract_txt:summarization in 934) [ClassicSimilarity], result of:
            0.59872526 = score(doc=934,freq=1.0), product of:
              0.767477 = queryWeight, product of:
                1.7866745 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.060224857 = queryNorm
              0.78012145 = fieldWeight in 934, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.109375 = fieldNorm(doc=934)
        0.5 = coord(2/4)
    
  5. Bar-Ilan, J.: What do we know about links and linking? : a framework for studying links in academic environments (2005) 0.35
    0.34504882 = sum of:
      0.34504882 = product of:
        0.4600651 = sum of:
          0.12987448 = weight(abstract_txt:classification in 1058) [ClassicSimilarity], result of:
            0.12987448 = score(doc=1058,freq=3.0), product of:
              0.24042216 = queryWeight, product of:
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.060224857 = queryNorm
              0.5401935 = fieldWeight in 1058, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.078125 = fieldNorm(doc=1058)
          0.07606501 = weight(abstract_txt:through in 1058) [ClassicSimilarity], result of:
            0.07606501 = score(doc=1058,freq=1.0), product of:
              0.24272934 = queryWeight, product of:
                1.0047867 = boost
                4.011184 = idf(docFreq=2176, maxDocs=44218)
                0.060224857 = queryNorm
              0.31337377 = fieldWeight in 1058, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.011184 = idf(docFreq=2176, maxDocs=44218)
                0.078125 = fieldNorm(doc=1058)
          0.2541256 = weight(abstract_txt:page in 1058) [ClassicSimilarity], result of:
            0.2541256 = score(doc=1058,freq=1.0), product of:
              0.5424561 = queryWeight, product of:
                1.5020869 = boost
                5.9964437 = idf(docFreq=298, maxDocs=44218)
                0.060224857 = queryNorm
              0.46847218 = fieldWeight in 1058, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9964437 = idf(docFreq=298, maxDocs=44218)
                0.078125 = fieldNorm(doc=1058)
        0.75 = coord(3/4)