Document (#22858)

Author
Watters, C.
Wang, H.
Title
Rating new documents for similarity
Source
Journal of the American Society for Information Science. 51(2000) no.9, S.793-804
Year
2000
Abstract
Electronic news has long held the promise of personalized and dynamic delivery of current event new items, particularly for Web users. Although wlwctronic versions of print news are now widely available, the personalization of that delivery has not yet been accomplished. In this paper, we present a methodology of associating news documents based on the extraction of feature phrases, where feature phrases identify dates, locations, people and organizations. A news representation is created from these feature phrases to define news objects that can then be compared and ranked to find related news items. Unlike tradtional information retrieval, we are much more interested in precision than recall. That is, the user would like to see one or more specifically related articles, rather than all somewhat related articles. The algorithm is designed to work interactively the the user using regular web browsers as the interface
Theme
Internet
Form
Zeitungen

Similar documents (author)

  1. Watters, C.: Extending the multimedia class hierarchy for hypermedia applications (1996) 2.44
    2.4420898 = sum of:
      2.4420898 = product of:
        4.8841796 = sum of:
          4.8841796 = weight(author_txt:watters in 1606) [ClassicSimilarity], result of:
            4.8841796 = score(doc=1606,freq=1.0), product of:
              0.87151396 = queryWeight, product of:
                1.3331373 = boost
                8.966795 = idf(docFreq=14, maxDocs=43254)
                0.07290582 = queryNorm
              5.604247 = fieldWeight in 1606, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.966795 = idf(docFreq=14, maxDocs=43254)
                0.625 = fieldNorm(doc=1606)
        0.5 = coord(1/2)
    
  2. Watters, C.: Information retrieval and the virtual document (1999) 2.44
    2.4420898 = sum of:
      2.4420898 = product of:
        4.8841796 = sum of:
          4.8841796 = weight(author_txt:watters in 6320) [ClassicSimilarity], result of:
            4.8841796 = score(doc=6320,freq=1.0), product of:
              0.87151396 = queryWeight, product of:
                1.3331373 = boost
                8.966795 = idf(docFreq=14, maxDocs=43254)
                0.07290582 = queryNorm
              5.604247 = fieldWeight in 6320, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.966795 = idf(docFreq=14, maxDocs=43254)
                0.625 = fieldNorm(doc=6320)
        0.5 = coord(1/2)
    
  3. Watters, C.; Shepherd, M.A.: Shifting the information paradigm from data-centered to user-centered (1994) 1.95
    1.9536718 = sum of:
      1.9536718 = product of:
        3.9073436 = sum of:
          3.9073436 = weight(author_txt:watters in 290) [ClassicSimilarity], result of:
            3.9073436 = score(doc=290,freq=1.0), product of:
              0.87151396 = queryWeight, product of:
                1.3331373 = boost
                8.966795 = idf(docFreq=14, maxDocs=43254)
                0.07290582 = queryNorm
              4.4833975 = fieldWeight in 290, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.966795 = idf(docFreq=14, maxDocs=43254)
                0.5 = fieldNorm(doc=290)
        0.5 = coord(1/2)
    
  4. Carrick, C.; Watters, C.: Automatic association of news items (1997) 1.95
    1.9536718 = sum of:
      1.9536718 = product of:
        3.9073436 = sum of:
          3.9073436 = weight(author_txt:watters in 3550) [ClassicSimilarity], result of:
            3.9073436 = score(doc=3550,freq=1.0), product of:
              0.87151396 = queryWeight, product of:
                1.3331373 = boost
                8.966795 = idf(docFreq=14, maxDocs=43254)
                0.07290582 = queryNorm
              4.4833975 = fieldWeight in 3550, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.966795 = idf(docFreq=14, maxDocs=43254)
                0.5 = fieldNorm(doc=3550)
        0.5 = coord(1/2)
    
  5. Watters, C.; Amoudi, A.: Geosearcher : location-based ranking of search engine results (2003) 1.95
    1.9536718 = sum of:
      1.9536718 = product of:
        3.9073436 = sum of:
          3.9073436 = weight(author_txt:watters in 153) [ClassicSimilarity], result of:
            3.9073436 = score(doc=153,freq=1.0), product of:
              0.87151396 = queryWeight, product of:
                1.3331373 = boost
                8.966795 = idf(docFreq=14, maxDocs=43254)
                0.07290582 = queryNorm
              4.4833975 = fieldWeight in 153, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.966795 = idf(docFreq=14, maxDocs=43254)
                0.5 = fieldNorm(doc=153)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Sela, M.; Lavie, T.; Inbar, O.; Oppenheim, I.; Meyer, J.: Personalizing news content : an experimental study (2015) 0.33
    0.3328107 = sum of:
      0.3328107 = product of:
        1.1886096 = sum of:
          0.009497967 = weight(abstract_txt:that in 3069) [ClassicSimilarity], result of:
            0.009497967 = score(doc=3069,freq=2.0), product of:
              0.04504758 = queryWeight, product of:
                1.0150892 = boost
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.018603863 = queryNorm
              0.210843 = fieldWeight in 3069, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.0625 = fieldNorm(doc=3069)
          0.15357408 = weight(abstract_txt:personalized in 3069) [ClassicSimilarity], result of:
            0.15357408 = score(doc=3069,freq=6.0), product of:
              0.13847843 = queryWeight, product of:
                1.0275403 = boost
                7.244028 = idf(docFreq=83, maxDocs=43254)
                0.018603863 = queryNorm
              1.1090108 = fieldWeight in 3069, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.244028 = idf(docFreq=83, maxDocs=43254)
                0.0625 = fieldNorm(doc=3069)
          0.032856595 = weight(abstract_txt:user in 3069) [ClassicSimilarity], result of:
            0.032856595 = score(doc=3069,freq=4.0), product of:
              0.07144289 = queryWeight, product of:
                1.043764 = boost
                3.6792014 = idf(docFreq=2967, maxDocs=43254)
                0.018603863 = queryNorm
              0.45990017 = fieldWeight in 3069, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6792014 = idf(docFreq=2967, maxDocs=43254)
                0.0625 = fieldNorm(doc=3069)
          0.07715441 = weight(abstract_txt:personalization in 3069) [ClassicSimilarity], result of:
            0.07715441 = score(doc=3069,freq=1.0), product of:
              0.15902343 = queryWeight, product of:
                1.1011294 = boost
                7.762822 = idf(docFreq=49, maxDocs=43254)
                0.018603863 = queryNorm
              0.48517638 = fieldWeight in 3069, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.762822 = idf(docFreq=49, maxDocs=43254)
                0.0625 = fieldNorm(doc=3069)
          0.1287071 = weight(abstract_txt:items in 3069) [ClassicSimilarity], result of:
            0.1287071 = score(doc=3069,freq=5.0), product of:
              0.16480698 = queryWeight, product of:
                1.5852969 = boost
                5.5880704 = idf(docFreq=439, maxDocs=43254)
                0.018603863 = queryNorm
              0.78095657 = fieldWeight in 3069, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.5880704 = idf(docFreq=439, maxDocs=43254)
                0.0625 = fieldNorm(doc=3069)
          0.07243 = weight(abstract_txt:delivery in 3069) [ClassicSimilarity], result of:
            0.07243 = score(doc=3069,freq=1.0), product of:
              0.19209214 = queryWeight, product of:
                1.7115028 = boost
                6.032938 = idf(docFreq=281, maxDocs=43254)
                0.018603863 = queryNorm
              0.37705863 = fieldWeight in 3069, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.032938 = idf(docFreq=281, maxDocs=43254)
                0.0625 = fieldNorm(doc=3069)
          0.71438944 = weight(abstract_txt:news in 3069) [ClassicSimilarity], result of:
            0.71438944 = score(doc=3069,freq=11.0), product of:
              0.5729237 = queryWeight, product of:
                5.1195507 = boost
                6.0153627 = idf(docFreq=286, maxDocs=43254)
                0.018603863 = queryNorm
              1.2469189 = fieldWeight in 3069, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                6.0153627 = idf(docFreq=286, maxDocs=43254)
                0.0625 = fieldNorm(doc=3069)
        0.28 = coord(7/25)
    
  2. Watters, C.R.; Shepherd, M.A.; Burkowski, F.J.: Electronic news delivery project (1998) 0.20
    0.19566308 = sum of:
      0.19566308 = product of:
        0.97831535 = sum of:
          0.006716077 = weight(abstract_txt:that in 2445) [ClassicSimilarity], result of:
            0.006716077 = score(doc=2445,freq=1.0), product of:
              0.04504758 = queryWeight, product of:
                1.0150892 = boost
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.018603863 = queryNorm
              0.14908852 = fieldWeight in 2445, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.0625 = fieldNorm(doc=2445)
          0.06269635 = weight(abstract_txt:personalized in 2445) [ClassicSimilarity], result of:
            0.06269635 = score(doc=2445,freq=1.0), product of:
              0.13847843 = queryWeight, product of:
                1.0275403 = boost
                7.244028 = idf(docFreq=83, maxDocs=43254)
                0.018603863 = queryNorm
              0.45275176 = fieldWeight in 2445, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.244028 = idf(docFreq=83, maxDocs=43254)
                0.0625 = fieldNorm(doc=2445)
          0.12545243 = weight(abstract_txt:delivery in 2445) [ClassicSimilarity], result of:
            0.12545243 = score(doc=2445,freq=3.0), product of:
              0.19209214 = queryWeight, product of:
                1.7115028 = boost
                6.032938 = idf(docFreq=281, maxDocs=43254)
                0.018603863 = queryNorm
              0.6530847 = fieldWeight in 2445, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.032938 = idf(docFreq=281, maxDocs=43254)
                0.0625 = fieldNorm(doc=2445)
          0.037295092 = weight(abstract_txt:related in 2445) [ClassicSimilarity], result of:
            0.037295092 = score(doc=2445,freq=1.0), product of:
              0.14126313 = queryWeight, product of:
                1.7975577 = boost
                4.224184 = idf(docFreq=1720, maxDocs=43254)
                0.018603863 = queryNorm
              0.2640115 = fieldWeight in 2445, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.224184 = idf(docFreq=1720, maxDocs=43254)
                0.0625 = fieldNorm(doc=2445)
          0.7461554 = weight(abstract_txt:news in 2445) [ClassicSimilarity], result of:
            0.7461554 = score(doc=2445,freq=12.0), product of:
              0.5729237 = queryWeight, product of:
                5.1195507 = boost
                6.0153627 = idf(docFreq=286, maxDocs=43254)
                0.018603863 = queryNorm
              1.3023642 = fieldWeight in 2445, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                6.0153627 = idf(docFreq=286, maxDocs=43254)
                0.0625 = fieldNorm(doc=2445)
        0.2 = coord(5/25)
    
  3. Shapira, B.; Shoval, P.; Tractinsky, N.; Meyer, J.: ePaper : a personalized mobile newspaper (2009) 0.19
    0.1861053 = sum of:
      0.1861053 = product of:
        0.9305265 = sum of:
          0.07837044 = weight(abstract_txt:personalized in 169) [ClassicSimilarity], result of:
            0.07837044 = score(doc=169,freq=1.0), product of:
              0.13847843 = queryWeight, product of:
                1.0275403 = boost
                7.244028 = idf(docFreq=83, maxDocs=43254)
                0.018603863 = queryNorm
              0.56593966 = fieldWeight in 169, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.244028 = idf(docFreq=83, maxDocs=43254)
                0.078125 = fieldNorm(doc=169)
          0.029041402 = weight(abstract_txt:user in 169) [ClassicSimilarity], result of:
            0.029041402 = score(doc=169,freq=2.0), product of:
              0.07144289 = queryWeight, product of:
                1.043764 = boost
                3.6792014 = idf(docFreq=2967, maxDocs=43254)
                0.018603863 = queryNorm
              0.40649816 = fieldWeight in 169, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6792014 = idf(docFreq=2967, maxDocs=43254)
                0.078125 = fieldNorm(doc=169)
          0.09644301 = weight(abstract_txt:personalization in 169) [ClassicSimilarity], result of:
            0.09644301 = score(doc=169,freq=1.0), product of:
              0.15902343 = queryWeight, product of:
                1.1011294 = boost
                7.762822 = idf(docFreq=49, maxDocs=43254)
                0.018603863 = queryNorm
              0.60647047 = fieldWeight in 169, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.762822 = idf(docFreq=49, maxDocs=43254)
                0.078125 = fieldNorm(doc=169)
          0.1246201 = weight(abstract_txt:items in 169) [ClassicSimilarity], result of:
            0.1246201 = score(doc=169,freq=3.0), product of:
              0.16480698 = queryWeight, product of:
                1.5852969 = boost
                5.5880704 = idf(docFreq=439, maxDocs=43254)
                0.018603863 = queryNorm
              0.75615793 = fieldWeight in 169, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5880704 = idf(docFreq=439, maxDocs=43254)
                0.078125 = fieldNorm(doc=169)
          0.6020515 = weight(abstract_txt:news in 169) [ClassicSimilarity], result of:
            0.6020515 = score(doc=169,freq=5.0), product of:
              0.5729237 = queryWeight, product of:
                5.1195507 = boost
                6.0153627 = idf(docFreq=286, maxDocs=43254)
                0.018603863 = queryNorm
              1.0508406 = fieldWeight in 169, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.0153627 = idf(docFreq=286, maxDocs=43254)
                0.078125 = fieldNorm(doc=169)
        0.2 = coord(5/25)
    
  4. Carrick, C.; Watters, C.: Automatic association of news items (1997) 0.19
    0.18571556 = sum of:
      0.18571556 = product of:
        0.9285778 = sum of:
          0.1021576 = weight(abstract_txt:event in 3550) [ClassicSimilarity], result of:
            0.1021576 = score(doc=3550,freq=2.0), product of:
              0.13115487 = queryWeight, product of:
                7.0498724 = idf(docFreq=101, maxDocs=43254)
                0.018603863 = queryNorm
              0.7789082 = fieldWeight in 3550, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0498724 = idf(docFreq=101, maxDocs=43254)
                0.078125 = fieldNorm(doc=3550)
          0.014540733 = weight(abstract_txt:that in 3550) [ClassicSimilarity], result of:
            0.014540733 = score(doc=3550,freq=3.0), product of:
              0.04504758 = queryWeight, product of:
                1.0150892 = boost
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.018603863 = queryNorm
              0.3227861 = fieldWeight in 3550, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.078125 = fieldNorm(doc=3550)
          0.1438989 = weight(abstract_txt:items in 3550) [ClassicSimilarity], result of:
            0.1438989 = score(doc=3550,freq=4.0), product of:
              0.16480698 = queryWeight, product of:
                1.5852969 = boost
                5.5880704 = idf(docFreq=439, maxDocs=43254)
                0.018603863 = queryNorm
              0.873136 = fieldWeight in 3550, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.5880704 = idf(docFreq=439, maxDocs=43254)
                0.078125 = fieldNorm(doc=3550)
          0.065929025 = weight(abstract_txt:related in 3550) [ClassicSimilarity], result of:
            0.065929025 = score(doc=3550,freq=2.0), product of:
              0.14126313 = queryWeight, product of:
                1.7975577 = boost
                4.224184 = idf(docFreq=1720, maxDocs=43254)
                0.018603863 = queryNorm
              0.4667108 = fieldWeight in 3550, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.224184 = idf(docFreq=1720, maxDocs=43254)
                0.078125 = fieldNorm(doc=3550)
          0.6020515 = weight(abstract_txt:news in 3550) [ClassicSimilarity], result of:
            0.6020515 = score(doc=3550,freq=5.0), product of:
              0.5729237 = queryWeight, product of:
                5.1195507 = boost
                6.0153627 = idf(docFreq=286, maxDocs=43254)
                0.018603863 = queryNorm
              1.0508406 = fieldWeight in 3550, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.0153627 = idf(docFreq=286, maxDocs=43254)
                0.078125 = fieldNorm(doc=3550)
        0.2 = coord(5/25)
    
  5. Ou, S.; Khoo, C.S.G.; Goh, D.H.: Multi-document summarization of news articles using an event-based framework (2006) 0.18
    0.17835842 = sum of:
      0.17835842 = product of:
        0.8917921 = sum of:
          0.1528955 = weight(abstract_txt:event in 1783) [ClassicSimilarity], result of:
            0.1528955 = score(doc=1783,freq=7.0), product of:
              0.13115487 = queryWeight, product of:
                7.0498724 = idf(docFreq=101, maxDocs=43254)
                0.018603863 = queryNorm
              1.165763 = fieldWeight in 1783, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.0498724 = idf(docFreq=101, maxDocs=43254)
                0.0625 = fieldNorm(doc=1783)
          0.009497967 = weight(abstract_txt:that in 1783) [ClassicSimilarity], result of:
            0.009497967 = score(doc=1783,freq=2.0), product of:
              0.04504758 = queryWeight, product of:
                1.0150892 = boost
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.018603863 = queryNorm
              0.210843 = fieldWeight in 1783, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.0625 = fieldNorm(doc=1783)
          0.023233121 = weight(abstract_txt:user in 1783) [ClassicSimilarity], result of:
            0.023233121 = score(doc=1783,freq=2.0), product of:
              0.07144289 = queryWeight, product of:
                1.043764 = boost
                3.6792014 = idf(docFreq=2967, maxDocs=43254)
                0.018603863 = queryNorm
              0.32519853 = fieldWeight in 1783, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6792014 = idf(docFreq=2967, maxDocs=43254)
                0.0625 = fieldNorm(doc=1783)
          0.096932225 = weight(abstract_txt:articles in 1783) [ClassicSimilarity], result of:
            0.096932225 = score(doc=1783,freq=7.0), product of:
              0.12194857 = queryWeight, product of:
                1.3636758 = boost
                4.8068705 = idf(docFreq=960, maxDocs=43254)
                0.018603863 = queryNorm
              0.7948615 = fieldWeight in 1783, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.8068705 = idf(docFreq=960, maxDocs=43254)
                0.0625 = fieldNorm(doc=1783)
          0.6092333 = weight(abstract_txt:news in 1783) [ClassicSimilarity], result of:
            0.6092333 = score(doc=1783,freq=8.0), product of:
              0.5729237 = queryWeight, product of:
                5.1195507 = boost
                6.0153627 = idf(docFreq=286, maxDocs=43254)
                0.018603863 = queryNorm
              1.063376 = fieldWeight in 1783, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.0153627 = idf(docFreq=286, maxDocs=43254)
                0.0625 = fieldNorm(doc=1783)
        0.2 = coord(5/25)