Document (#22858)

Author
Watters, C.
Wang, H.
Title
Rating new documents for similarity
Source
Journal of the American Society for Information Science. 51(2000) no.9, S.793-804
Year
2000
Abstract
Electronic news has long held the promise of personalized and dynamic delivery of current event new items, particularly for Web users. Although wlwctronic versions of print news are now widely available, the personalization of that delivery has not yet been accomplished. In this paper, we present a methodology of associating news documents based on the extraction of feature phrases, where feature phrases identify dates, locations, people and organizations. A news representation is created from these feature phrases to define news objects that can then be compared and ranked to find related news items. Unlike tradtional information retrieval, we are much more interested in precision than recall. That is, the user would like to see one or more specifically related articles, rather than all somewhat related articles. The algorithm is designed to work interactively the the user using regular web browsers as the interface
Theme
Internet
Form
Zeitungen

Similar documents (author)

  1. Watters, C.: Extending the multimedia class hierarchy for hypermedia applications (1996) 2.42
    2.422723 = sum of:
      2.422723 = product of:
        4.845446 = sum of:
          4.845446 = weight(author_txt:watters in 606) [ClassicSimilarity], result of:
            4.845446 = score(doc=606,freq=1.0), product of:
              0.86674464 = queryWeight, product of:
                1.3182664 = boost
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.07350644 = queryNorm
              5.5903964 = fieldWeight in 606, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.625 = fieldNorm(doc=606)
        0.5 = coord(1/2)
    
  2. Watters, C.: Information retrieval and the virtual document (1999) 2.42
    2.422723 = sum of:
      2.422723 = product of:
        4.845446 = sum of:
          4.845446 = weight(author_txt:watters in 5320) [ClassicSimilarity], result of:
            4.845446 = score(doc=5320,freq=1.0), product of:
              0.86674464 = queryWeight, product of:
                1.3182664 = boost
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.07350644 = queryNorm
              5.5903964 = fieldWeight in 5320, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.625 = fieldNorm(doc=5320)
        0.5 = coord(1/2)
    
  3. Watters, C.; Shepherd, M.A.: Shifting the information paradigm from data-centered to user-centered (1994) 1.94
    1.9381785 = sum of:
      1.9381785 = product of:
        3.876357 = sum of:
          3.876357 = weight(author_txt:watters in 7290) [ClassicSimilarity], result of:
            3.876357 = score(doc=7290,freq=1.0), product of:
              0.86674464 = queryWeight, product of:
                1.3182664 = boost
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.07350644 = queryNorm
              4.472317 = fieldWeight in 7290, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.5 = fieldNorm(doc=7290)
        0.5 = coord(1/2)
    
  4. Carrick, C.; Watters, C.: Automatic association of news items (1997) 1.94
    1.9381785 = sum of:
      1.9381785 = product of:
        3.876357 = sum of:
          3.876357 = weight(author_txt:watters in 2550) [ClassicSimilarity], result of:
            3.876357 = score(doc=2550,freq=1.0), product of:
              0.86674464 = queryWeight, product of:
                1.3182664 = boost
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.07350644 = queryNorm
              4.472317 = fieldWeight in 2550, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.5 = fieldNorm(doc=2550)
        0.5 = coord(1/2)
    
  5. Watters, C.; Amoudi, A.: Geosearcher : location-based ranking of search engine results (2003) 1.94
    1.9381785 = sum of:
      1.9381785 = product of:
        3.876357 = sum of:
          3.876357 = weight(author_txt:watters in 153) [ClassicSimilarity], result of:
            3.876357 = score(doc=153,freq=1.0), product of:
              0.86674464 = queryWeight, product of:
                1.3182664 = boost
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.07350644 = queryNorm
              4.472317 = fieldWeight in 153, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.5 = fieldNorm(doc=153)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Sela, M.; Lavie, T.; Inbar, O.; Oppenheim, I.; Meyer, J.: Personalizing news content : an experimental study (2015) 0.34
    0.33798844 = sum of:
      0.33798844 = product of:
        1.2071016 = sum of:
          0.00962782 = weight(abstract_txt:that in 3605) [ClassicSimilarity], result of:
            0.00962782 = score(doc=3605,freq=2.0), product of:
              0.045294344 = queryWeight, product of:
                1.0163068 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.018532338 = queryNorm
              0.2125612 = fieldWeight in 3605, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.0625 = fieldNorm(doc=3605)
          0.15522103 = weight(abstract_txt:personalized in 3605) [ClassicSimilarity], result of:
            0.15522103 = score(doc=3605,freq=6.0), product of:
              0.13896695 = queryWeight, product of:
                1.0277748 = boost
                7.295975 = idf(docFreq=77, maxDocs=42306)
                0.018532338 = queryNorm
              1.1169636 = fieldWeight in 3605, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.295975 = idf(docFreq=77, maxDocs=42306)
                0.0625 = fieldNorm(doc=3605)
          0.03249507 = weight(abstract_txt:user in 3605) [ClassicSimilarity], result of:
            0.03249507 = score(doc=3605,freq=4.0), product of:
              0.070663735 = queryWeight, product of:
                1.0364671 = boost
                3.67884 = idf(docFreq=2903, maxDocs=42306)
                0.018532338 = queryNorm
              0.459855 = fieldWeight in 3605, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.67884 = idf(docFreq=2903, maxDocs=42306)
                0.0625 = fieldNorm(doc=3605)
          0.077505454 = weight(abstract_txt:personalization in 3605) [ClassicSimilarity], result of:
            0.077505454 = score(doc=3605,freq=1.0), product of:
              0.15893386 = queryWeight, product of:
                1.0991335 = boost
                7.8025365 = idf(docFreq=46, maxDocs=42306)
                0.018532338 = queryNorm
              0.48765853 = fieldWeight in 3605, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8025365 = idf(docFreq=46, maxDocs=42306)
                0.0625 = fieldNorm(doc=3605)
          0.12691027 = weight(abstract_txt:items in 3605) [ClassicSimilarity], result of:
            0.12691027 = score(doc=3605,freq=5.0), product of:
              0.16268447 = queryWeight, product of:
                1.5726434 = boost
                5.5819464 = idf(docFreq=432, maxDocs=42306)
                0.018532338 = queryNorm
              0.7801007 = fieldWeight in 3605, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.5819464 = idf(docFreq=432, maxDocs=42306)
                0.0625 = fieldNorm(doc=3605)
          0.0712464 = weight(abstract_txt:delivery in 3605) [ClassicSimilarity], result of:
            0.0712464 = score(doc=3605,freq=1.0), product of:
              0.1893129 = queryWeight, product of:
                1.6964744 = boost
                6.0214725 = idf(docFreq=278, maxDocs=42306)
                0.018532338 = queryNorm
              0.37634203 = fieldWeight in 3605, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0214725 = idf(docFreq=278, maxDocs=42306)
                0.0625 = fieldNorm(doc=3605)
          0.73409563 = weight(abstract_txt:news in 3605) [ClassicSimilarity], result of:
            0.73409563 = score(doc=3605,freq=11.0), product of:
              0.58132124 = queryWeight, product of:
                5.149036 = boost
                6.0920024 = idf(docFreq=259, maxDocs=42306)
                0.018532338 = queryNorm
              1.2628055 = fieldWeight in 3605, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                6.0920024 = idf(docFreq=259, maxDocs=42306)
                0.0625 = fieldNorm(doc=3605)
        0.28 = coord(7/25)
    
  2. Watters, C.R.; Shepherd, M.A.; Burkowski, F.J.: Electronic news delivery project (1998) 0.20
    0.1995766 = sum of:
      0.1995766 = product of:
        0.997883 = sum of:
          0.0068078972 = weight(abstract_txt:that in 1445) [ClassicSimilarity], result of:
            0.0068078972 = score(doc=1445,freq=1.0), product of:
              0.045294344 = queryWeight, product of:
                1.0163068 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.018532338 = queryNorm
              0.15030347 = fieldWeight in 1445, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.0625 = fieldNorm(doc=1445)
          0.063368715 = weight(abstract_txt:personalized in 1445) [ClassicSimilarity], result of:
            0.063368715 = score(doc=1445,freq=1.0), product of:
              0.13896695 = queryWeight, product of:
                1.0277748 = boost
                7.295975 = idf(docFreq=77, maxDocs=42306)
                0.018532338 = queryNorm
              0.45599845 = fieldWeight in 1445, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.295975 = idf(docFreq=77, maxDocs=42306)
                0.0625 = fieldNorm(doc=1445)
          0.12340239 = weight(abstract_txt:delivery in 1445) [ClassicSimilarity], result of:
            0.12340239 = score(doc=1445,freq=3.0), product of:
              0.1893129 = queryWeight, product of:
                1.6964744 = boost
                6.0214725 = idf(docFreq=278, maxDocs=42306)
                0.018532338 = queryNorm
              0.6518435 = fieldWeight in 1445, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.0214725 = idf(docFreq=278, maxDocs=42306)
                0.0625 = fieldNorm(doc=1445)
          0.03756621 = weight(abstract_txt:related in 1445) [ClassicSimilarity], result of:
            0.03756621 = score(doc=1445,freq=1.0), product of:
              0.14143828 = queryWeight, product of:
                1.7959172 = boost
                4.2496233 = idf(docFreq=1640, maxDocs=42306)
                0.018532338 = queryNorm
              0.26560146 = fieldWeight in 1445, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2496233 = idf(docFreq=1640, maxDocs=42306)
                0.0625 = fieldNorm(doc=1445)
          0.7667378 = weight(abstract_txt:news in 1445) [ClassicSimilarity], result of:
            0.7667378 = score(doc=1445,freq=12.0), product of:
              0.58132124 = queryWeight, product of:
                5.149036 = boost
                6.0920024 = idf(docFreq=259, maxDocs=42306)
                0.018532338 = queryNorm
              1.3189572 = fieldWeight in 1445, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                6.0920024 = idf(docFreq=259, maxDocs=42306)
                0.0625 = fieldNorm(doc=1445)
        0.2 = coord(5/25)
    
  3. Shapira, B.; Shoval, P.; Tractinsky, N.; Meyer, J.: ePaper : a personalized mobile newspaper (2009) 0.19
    0.18927076 = sum of:
      0.18927076 = product of:
        0.9463538 = sum of:
          0.07921089 = weight(abstract_txt:personalized in 169) [ClassicSimilarity], result of:
            0.07921089 = score(doc=169,freq=1.0), product of:
              0.13896695 = queryWeight, product of:
                1.0277748 = boost
                7.295975 = idf(docFreq=77, maxDocs=42306)
                0.018532338 = queryNorm
              0.5699981 = fieldWeight in 169, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.295975 = idf(docFreq=77, maxDocs=42306)
                0.078125 = fieldNorm(doc=169)
          0.028721856 = weight(abstract_txt:user in 169) [ClassicSimilarity], result of:
            0.028721856 = score(doc=169,freq=2.0), product of:
              0.070663735 = queryWeight, product of:
                1.0364671 = boost
                3.67884 = idf(docFreq=2903, maxDocs=42306)
                0.018532338 = queryNorm
              0.40645823 = fieldWeight in 169, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.67884 = idf(docFreq=2903, maxDocs=42306)
                0.078125 = fieldNorm(doc=169)
          0.09688182 = weight(abstract_txt:personalization in 169) [ClassicSimilarity], result of:
            0.09688182 = score(doc=169,freq=1.0), product of:
              0.15893386 = queryWeight, product of:
                1.0991335 = boost
                7.8025365 = idf(docFreq=46, maxDocs=42306)
                0.018532338 = queryNorm
              0.6095732 = fieldWeight in 169, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8025365 = idf(docFreq=46, maxDocs=42306)
                0.078125 = fieldNorm(doc=169)
          0.12288034 = weight(abstract_txt:items in 169) [ClassicSimilarity], result of:
            0.12288034 = score(doc=169,freq=3.0), product of:
              0.16268447 = queryWeight, product of:
                1.5726434 = boost
                5.5819464 = idf(docFreq=432, maxDocs=42306)
                0.018532338 = queryNorm
              0.75532925 = fieldWeight in 169, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5819464 = idf(docFreq=432, maxDocs=42306)
                0.078125 = fieldNorm(doc=169)
          0.6186589 = weight(abstract_txt:news in 169) [ClassicSimilarity], result of:
            0.6186589 = score(doc=169,freq=5.0), product of:
              0.58132124 = queryWeight, product of:
                5.149036 = boost
                6.0920024 = idf(docFreq=259, maxDocs=42306)
                0.018532338 = queryNorm
              1.064229 = fieldWeight in 169, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.0920024 = idf(docFreq=259, maxDocs=42306)
                0.078125 = fieldNorm(doc=169)
        0.2 = coord(5/25)
    
  4. Carrick, C.; Watters, C.: Automatic association of news items (1997) 0.19
    0.18897586 = sum of:
      0.18897586 = product of:
        0.94487923 = sum of:
          0.103182495 = weight(abstract_txt:event in 2550) [ClassicSimilarity], result of:
            0.103182495 = score(doc=2550,freq=2.0), product of:
              0.1315575 = queryWeight, product of:
                7.0988073 = idf(docFreq=94, maxDocs=42306)
                0.018532338 = queryNorm
              0.7843148 = fieldWeight in 2550, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0988073 = idf(docFreq=94, maxDocs=42306)
                0.078125 = fieldNorm(doc=2550)
          0.01473953 = weight(abstract_txt:that in 2550) [ClassicSimilarity], result of:
            0.01473953 = score(doc=2550,freq=3.0), product of:
              0.045294344 = queryWeight, product of:
                1.0163068 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.018532338 = queryNorm
              0.32541656 = fieldWeight in 2550, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.078125 = fieldNorm(doc=2550)
          0.14189 = weight(abstract_txt:items in 2550) [ClassicSimilarity], result of:
            0.14189 = score(doc=2550,freq=4.0), product of:
              0.16268447 = queryWeight, product of:
                1.5726434 = boost
                5.5819464 = idf(docFreq=432, maxDocs=42306)
                0.018532338 = queryNorm
              0.87217915 = fieldWeight in 2550, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.5819464 = idf(docFreq=432, maxDocs=42306)
                0.078125 = fieldNorm(doc=2550)
          0.06640831 = weight(abstract_txt:related in 2550) [ClassicSimilarity], result of:
            0.06640831 = score(doc=2550,freq=2.0), product of:
              0.14143828 = queryWeight, product of:
                1.7959172 = boost
                4.2496233 = idf(docFreq=1640, maxDocs=42306)
                0.018532338 = queryNorm
              0.46952146 = fieldWeight in 2550, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2496233 = idf(docFreq=1640, maxDocs=42306)
                0.078125 = fieldNorm(doc=2550)
          0.6186589 = weight(abstract_txt:news in 2550) [ClassicSimilarity], result of:
            0.6186589 = score(doc=2550,freq=5.0), product of:
              0.58132124 = queryWeight, product of:
                5.149036 = boost
                6.0920024 = idf(docFreq=259, maxDocs=42306)
                0.018532338 = queryNorm
              1.064229 = fieldWeight in 2550, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.0920024 = idf(docFreq=259, maxDocs=42306)
                0.078125 = fieldNorm(doc=2550)
        0.2 = coord(5/25)
    
  5. Ou, S.; Khoo, C.S.G.; Goh, D.H.: Multi-document summarization of news articles using an event-based framework (2006) 0.18
    0.18195988 = sum of:
      0.18195988 = product of:
        0.9097994 = sum of:
          0.15442942 = weight(abstract_txt:event in 1783) [ClassicSimilarity], result of:
            0.15442942 = score(doc=1783,freq=7.0), product of:
              0.1315575 = queryWeight, product of:
                7.0988073 = idf(docFreq=94, maxDocs=42306)
                0.018532338 = queryNorm
              1.173855 = fieldWeight in 1783, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.0988073 = idf(docFreq=94, maxDocs=42306)
                0.0625 = fieldNorm(doc=1783)
          0.00962782 = weight(abstract_txt:that in 1783) [ClassicSimilarity], result of:
            0.00962782 = score(doc=1783,freq=2.0), product of:
              0.045294344 = queryWeight, product of:
                1.0163068 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.018532338 = queryNorm
              0.2125612 = fieldWeight in 1783, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.0625 = fieldNorm(doc=1783)
          0.022977486 = weight(abstract_txt:user in 1783) [ClassicSimilarity], result of:
            0.022977486 = score(doc=1783,freq=2.0), product of:
              0.070663735 = queryWeight, product of:
                1.0364671 = boost
                3.67884 = idf(docFreq=2903, maxDocs=42306)
                0.018532338 = queryNorm
              0.32516658 = fieldWeight in 1783, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.67884 = idf(docFreq=2903, maxDocs=42306)
                0.0625 = fieldNorm(doc=1783)
          0.09672582 = weight(abstract_txt:articles in 1783) [ClassicSimilarity], result of:
            0.09672582 = score(doc=1783,freq=7.0), product of:
              0.12133903 = queryWeight, product of:
                1.3581804 = boost
                4.8207307 = idf(docFreq=926, maxDocs=42306)
                0.018532338 = queryNorm
              0.7971534 = fieldWeight in 1783, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.8207307 = idf(docFreq=926, maxDocs=42306)
                0.0625 = fieldNorm(doc=1783)
          0.62603885 = weight(abstract_txt:news in 1783) [ClassicSimilarity], result of:
            0.62603885 = score(doc=1783,freq=8.0), product of:
              0.58132124 = queryWeight, product of:
                5.149036 = boost
                6.0920024 = idf(docFreq=259, maxDocs=42306)
                0.018532338 = queryNorm
              1.0769241 = fieldWeight in 1783, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.0920024 = idf(docFreq=259, maxDocs=42306)
                0.0625 = fieldNorm(doc=1783)
        0.2 = coord(5/25)