Document (#19518)

Author
Mock, K.J.
Vemuri, V.R.
Title
Information filtering via hill climbing, WordNet, and index patterns
Source
Information processing and management. 33(1997) no.5, S.633-644
Year
1997
Abstract
The INFOS (Intelligent News Filtering Organizational System) project is designed to reduce the user's search burden by automatically categorising data as relevant or irrelevant based upon user interests. These predictions are learned automatically based upon features taken from input articles and collaborative features derived from other users. The filtering is performed by a hybrid technique that combines elements of a keyword-based hill climbing method, knowledge-based conceptual representation via WordNet, and partial parsing via index patterns. The hybrid systems integrating all these approaches combines the benefits of each while maintaing robustness and acalability
Footnote
Contribution to a special issue devoted to electronic newspapers
Theme
Computerlinguistik
Object
WordNet

Similar documents (content)

  1. Chandrasekar, R.; Srinivas, B.: Automatic induction of rules for text simplification (1997) 0.13
    0.13180126 = sum of:
      0.13180126 = product of:
        0.5491719 = sum of:
          0.016981618 = weight(abstract_txt:these in 2873) [ClassicSimilarity], result of:
            0.016981618 = score(doc=2873,freq=1.0), product of:
              0.056816157 = queryWeight, product of:
                1.0194724 = boost
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.017480766 = queryNorm
              0.29888713 = fieldWeight in 2873, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.09375 = fieldNorm(doc=2873)
          0.08581057 = weight(abstract_txt:partial in 2873) [ClassicSimilarity], result of:
            0.08581057 = score(doc=2873,freq=1.0), product of:
              0.13279131 = queryWeight, product of:
                1.1020705 = boost
                6.892866 = idf(docFreq=121, maxDocs=44218)
                0.017480766 = queryNorm
              0.6462062 = fieldWeight in 2873, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.892866 = idf(docFreq=121, maxDocs=44218)
                0.09375 = fieldNorm(doc=2873)
          0.121762566 = weight(abstract_txt:parsing in 2873) [ClassicSimilarity], result of:
            0.121762566 = score(doc=2873,freq=1.0), product of:
              0.16768144 = queryWeight, product of:
                1.2384174 = boost
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.017480766 = queryNorm
              0.7261541 = fieldWeight in 2873, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.09375 = fieldNorm(doc=2873)
          0.12444133 = weight(abstract_txt:automatically in 2873) [ClassicSimilarity], result of:
            0.12444133 = score(doc=2873,freq=2.0), product of:
              0.17013183 = queryWeight, product of:
                1.764137 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.017480766 = queryNorm
              0.7314406 = fieldWeight in 2873, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.09375 = fieldNorm(doc=2873)
          0.048022155 = weight(abstract_txt:based in 2873) [ClassicSimilarity], result of:
            0.048022155 = score(doc=2873,freq=2.0), product of:
              0.11361794 = queryWeight, product of:
                2.0388157 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.017480766 = queryNorm
              0.42266348 = fieldWeight in 2873, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.09375 = fieldNorm(doc=2873)
          0.15215369 = weight(abstract_txt:combines in 2873) [ClassicSimilarity], result of:
            0.15215369 = score(doc=2873,freq=1.0), product of:
              0.24509859 = queryWeight, product of:
                2.1174343 = boost
                6.6217136 = idf(docFreq=159, maxDocs=44218)
                0.017480766 = queryNorm
              0.62078565 = fieldWeight in 2873, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6217136 = idf(docFreq=159, maxDocs=44218)
                0.09375 = fieldNorm(doc=2873)
        0.24 = coord(6/25)
    
  2. Cullen, C.: Verity agent technology : automatic filtering, matching and dissemination of information (1996) 0.12
    0.1169289 = sum of:
      0.1169289 = product of:
        0.73080564 = sum of:
          0.07479274 = weight(abstract_txt:intelligent in 2415) [ClassicSimilarity], result of:
            0.07479274 = score(doc=2415,freq=1.0), product of:
              0.10933292 = queryWeight, product of:
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.017480766 = queryNorm
              0.68408257 = fieldWeight in 2415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.109375 = fieldNorm(doc=2415)
          0.2970765 = weight(abstract_txt:categorising in 2415) [ClassicSimilarity], result of:
            0.2970765 = score(doc=2415,freq=1.0), product of:
              0.27421433 = queryWeight, product of:
                1.5836879 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.017480766 = queryNorm
              1.0833733 = fieldWeight in 2415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.109375 = fieldNorm(doc=2415)
          0.10265887 = weight(abstract_txt:automatically in 2415) [ClassicSimilarity], result of:
            0.10265887 = score(doc=2415,freq=1.0), product of:
              0.17013183 = queryWeight, product of:
                1.764137 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.017480766 = queryNorm
              0.60340774 = fieldWeight in 2415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.109375 = fieldNorm(doc=2415)
          0.25627756 = weight(abstract_txt:filtering in 2415) [ClassicSimilarity], result of:
            0.25627756 = score(doc=2415,freq=1.0), product of:
              0.35839236 = queryWeight, product of:
                3.1359167 = boost
                6.537832 = idf(docFreq=173, maxDocs=44218)
                0.017480766 = queryNorm
              0.7150754 = fieldWeight in 2415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.537832 = idf(docFreq=173, maxDocs=44218)
                0.109375 = fieldNorm(doc=2415)
        0.16 = coord(4/25)
    
  3. Ma, W.-Y.; Manjunath, B.S.: ¬A texture thesaurus for browsing large aerial photographs (1998) 0.11
    0.11223936 = sum of:
      0.11223936 = product of:
        0.467664 = sum of:
          0.09329397 = weight(abstract_txt:robustness in 874) [ClassicSimilarity], result of:
            0.09329397 = score(doc=874,freq=1.0), product of:
              0.18398075 = queryWeight, product of:
                1.2972113 = boost
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.017480766 = queryNorm
              0.5070855 = fieldWeight in 874, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.0625 = fieldNorm(doc=874)
          0.046208203 = weight(abstract_txt:features in 874) [ClassicSimilarity], result of:
            0.046208203 = score(doc=874,freq=2.0), product of:
              0.115172654 = queryWeight, product of:
                1.4514905 = boost
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.017480766 = queryNorm
              0.4012081 = fieldWeight in 874, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.0625 = fieldNorm(doc=874)
          0.05138273 = weight(abstract_txt:patterns in 874) [ClassicSimilarity], result of:
            0.05138273 = score(doc=874,freq=1.0), product of:
              0.15574883 = queryWeight, product of:
                1.6879202 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.017480766 = queryNorm
              0.32990766 = fieldWeight in 874, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.0625 = fieldNorm(doc=874)
          0.02263786 = weight(abstract_txt:based in 874) [ClassicSimilarity], result of:
            0.02263786 = score(doc=874,freq=1.0), product of:
              0.11361794 = queryWeight, product of:
                2.0388157 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.017480766 = queryNorm
              0.19924548 = fieldWeight in 874, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=874)
          0.10769692 = weight(abstract_txt:hybrid in 874) [ClassicSimilarity], result of:
            0.10769692 = score(doc=874,freq=1.0), product of:
              0.25508338 = queryWeight, product of:
                2.1601336 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.017480766 = queryNorm
              0.4222028 = fieldWeight in 874, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.0625 = fieldNorm(doc=874)
          0.1464443 = weight(abstract_txt:filtering in 874) [ClassicSimilarity], result of:
            0.1464443 = score(doc=874,freq=1.0), product of:
              0.35839236 = queryWeight, product of:
                3.1359167 = boost
                6.537832 = idf(docFreq=173, maxDocs=44218)
                0.017480766 = queryNorm
              0.4086145 = fieldWeight in 874, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.537832 = idf(docFreq=173, maxDocs=44218)
                0.0625 = fieldNorm(doc=874)
        0.24 = coord(6/25)
    
  4. Liu, D.-R.; Shih, M.-J.: Hybrid-patent classification based on patent-network analysis (2011) 0.09
    0.09135991 = sum of:
      0.09135991 = product of:
        0.45679957 = sum of:
          0.07625895 = weight(abstract_txt:predictions in 4189) [ClassicSimilarity], result of:
            0.07625895 = score(doc=4189,freq=1.0), product of:
              0.16084117 = queryWeight, product of:
                1.2128948 = boost
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.017480766 = queryNorm
              0.47412583 = fieldWeight in 4189, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.0625 = fieldNorm(doc=4189)
          0.032674134 = weight(abstract_txt:features in 4189) [ClassicSimilarity], result of:
            0.032674134 = score(doc=4189,freq=1.0), product of:
              0.115172654 = queryWeight, product of:
                1.4514905 = boost
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.017480766 = queryNorm
              0.28369698 = fieldWeight in 4189, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.0625 = fieldNorm(doc=4189)
          0.05989415 = weight(abstract_txt:based in 4189) [ClassicSimilarity], result of:
            0.05989415 = score(doc=4189,freq=7.0), product of:
              0.11361794 = queryWeight, product of:
                2.0388157 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.017480766 = queryNorm
              0.52715397 = fieldWeight in 4189, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=4189)
          0.101435795 = weight(abstract_txt:combines in 4189) [ClassicSimilarity], result of:
            0.101435795 = score(doc=4189,freq=1.0), product of:
              0.24509859 = queryWeight, product of:
                2.1174343 = boost
                6.6217136 = idf(docFreq=159, maxDocs=44218)
                0.017480766 = queryNorm
              0.4138571 = fieldWeight in 4189, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6217136 = idf(docFreq=159, maxDocs=44218)
                0.0625 = fieldNorm(doc=4189)
          0.18653654 = weight(abstract_txt:hybrid in 4189) [ClassicSimilarity], result of:
            0.18653654 = score(doc=4189,freq=3.0), product of:
              0.25508338 = queryWeight, product of:
                2.1601336 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.017480766 = queryNorm
              0.7312767 = fieldWeight in 4189, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.0625 = fieldNorm(doc=4189)
        0.2 = coord(5/25)
    
  5. Mostafa, J.; Quiroga, L.M.; Palakal, M.: Filtering medical documents using automated and human classification methods (1998) 0.09
    0.090266995 = sum of:
      0.090266995 = product of:
        0.56416875 = sum of:
          0.014151349 = weight(abstract_txt:these in 2326) [ClassicSimilarity], result of:
            0.014151349 = score(doc=2326,freq=1.0), product of:
              0.056816157 = queryWeight, product of:
                1.0194724 = boost
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.017480766 = queryNorm
              0.24907261 = fieldWeight in 2326, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.078125 = fieldNorm(doc=2326)
          0.07332776 = weight(abstract_txt:automatically in 2326) [ClassicSimilarity], result of:
            0.07332776 = score(doc=2326,freq=1.0), product of:
              0.17013183 = queryWeight, product of:
                1.764137 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.017480766 = queryNorm
              0.4310055 = fieldWeight in 2326, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.078125 = fieldNorm(doc=2326)
          0.028297326 = weight(abstract_txt:based in 2326) [ClassicSimilarity], result of:
            0.028297326 = score(doc=2326,freq=1.0), product of:
              0.11361794 = queryWeight, product of:
                2.0388157 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.017480766 = queryNorm
              0.24905685 = fieldWeight in 2326, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.078125 = fieldNorm(doc=2326)
          0.4483923 = weight(abstract_txt:filtering in 2326) [ClassicSimilarity], result of:
            0.4483923 = score(doc=2326,freq=6.0), product of:
              0.35839236 = queryWeight, product of:
                3.1359167 = boost
                6.537832 = idf(docFreq=173, maxDocs=44218)
                0.017480766 = queryNorm
              1.2511213 = fieldWeight in 2326, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.537832 = idf(docFreq=173, maxDocs=44218)
                0.078125 = fieldNorm(doc=2326)
        0.16 = coord(4/25)