Document (#33008)

Author
Singh, S.
Dey, L.
Title
¬A rough-fuzzy document grading system for customized text information retrieval
Source
Information processing and management. 41(2005) no.2, S.195-216
Year
2005
Abstract
Due to the large repository of documents available on the web, users are usually inundated by a large volume of information, most of which is found to be irrelevant. Since user perspectives vary, a client-side text filtering system that learns the user's perspective can reduce the problem of irrelevant retrieval. In this paper, we have provided the design of a customized text information filtering system which learns user preferences and modifies the initial query to fetch better documents. It uses a rough-fuzzy reasoning scheme. The rough-set based reasoning takes care of natural language nuances, like synonym handling, very elegantly. The fuzzy decider provides qualitative grading to the documents for the user's perusal. We have provided the detailed design of the various modules and some results related to the performance analysis of the system.
Theme
Retrievalalgorithmen

Similar documents (author)

  1. Singh, S.: From reference to information services : a study on impact of Ranganathan (1992) 5.32
    5.3242707 = sum of:
      5.3242707 = weight(author_txt:singh in 6697) [ClassicSimilarity], result of:
        5.3242707 = fieldWeight in 6697, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.518833 = idf(docFreq=23, maxDocs=44218)
          0.625 = fieldNorm(doc=6697)
    
  2. Singh, S.: S.R. Ranganathan : a review of centenary celebration events and literature (1994) 5.32
    5.3242707 = sum of:
      5.3242707 = weight(author_txt:singh in 693) [ClassicSimilarity], result of:
        5.3242707 = fieldWeight in 693, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.518833 = idf(docFreq=23, maxDocs=44218)
          0.625 = fieldNorm(doc=693)
    
  3. Singh, S.: ¬A practical manual of Colon Classification : ed.7 (1990) 5.32
    5.3242707 = sum of:
      5.3242707 = weight(author_txt:singh in 2357) [ClassicSimilarity], result of:
        5.3242707 = fieldWeight in 2357, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.518833 = idf(docFreq=23, maxDocs=44218)
          0.625 = fieldNorm(doc=2357)
    
  4. Singh, A.K.: ¬A review of the Universal Decimal Classification: International Medium Edition - English text (BS1000M:1993) (1995) 5.32
    5.3242707 = sum of:
      5.3242707 = weight(author_txt:singh in 6813) [ClassicSimilarity], result of:
        5.3242707 = fieldWeight in 6813, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.518833 = idf(docFreq=23, maxDocs=44218)
          0.625 = fieldNorm(doc=6813)
    
  5. Singh, S.: Ranganathan and reference services (1992) 5.32
    5.3242707 = sum of:
      5.3242707 = weight(author_txt:singh in 2517) [ClassicSimilarity], result of:
        5.3242707 = fieldWeight in 2517, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.518833 = idf(docFreq=23, maxDocs=44218)
          0.625 = fieldNorm(doc=2517)
    

Similar documents (content)

  1. Miyamoto, S.: Application of rough sets to information retrieval (1998) 0.21
    0.20623052 = sum of:
      0.20623052 = product of:
        1.2889408 = sum of:
          0.029386653 = weight(abstract_txt:retrieval in 559) [ClassicSimilarity], result of:
            0.029386653 = score(doc=559,freq=3.0), product of:
              0.052077003 = queryWeight, product of:
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0149855865 = queryNorm
              0.5642923 = fieldWeight in 559, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=559)
          0.012168433 = weight(abstract_txt:information in 559) [ClassicSimilarity], result of:
            0.012168433 = score(doc=559,freq=2.0), product of:
              0.03791082 = queryWeight, product of:
                1.044971 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0149855865 = queryNorm
              0.32097518 = fieldWeight in 559, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.09375 = fieldNorm(doc=559)
          0.38894403 = weight(abstract_txt:fuzzy in 559) [ClassicSimilarity], result of:
            0.38894403 = score(doc=559,freq=4.0), product of:
              0.30305502 = queryWeight, product of:
                2.954496 = boost
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.0149855865 = queryNorm
              1.2834107 = fieldWeight in 559, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.09375 = fieldNorm(doc=559)
          0.8584417 = weight(abstract_txt:rough in 559) [ClassicSimilarity], result of:
            0.8584417 = score(doc=559,freq=6.0), product of:
              0.44878694 = queryWeight, product of:
                3.5953631 = boost
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.0149855865 = queryNorm
              1.9128046 = fieldWeight in 559, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.09375 = fieldNorm(doc=559)
        0.16 = coord(4/25)
    
  2. Shuldberg, H.K.; Macpherson, M.; Humphrey, P.: Distilling information from text : the EDS TeplateFiller system (1993) 0.17
    0.17482896 = sum of:
      0.17482896 = product of:
        0.5463405 = sum of:
          0.010140359 = weight(abstract_txt:information in 5642) [ClassicSimilarity], result of:
            0.010140359 = score(doc=5642,freq=2.0), product of:
              0.03791082 = queryWeight, product of:
                1.044971 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0149855865 = queryNorm
              0.2674793 = fieldWeight in 5642, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.078125 = fieldNorm(doc=5642)
          0.020305151 = weight(abstract_txt:design in 5642) [ClassicSimilarity], result of:
            0.020305151 = score(doc=5642,freq=1.0), product of:
              0.066289485 = queryWeight, product of:
                1.1282344 = boost
                3.9207718 = idf(docFreq=2382, maxDocs=44218)
                0.0149855865 = queryNorm
              0.3063103 = fieldWeight in 5642, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9207718 = idf(docFreq=2382, maxDocs=44218)
                0.078125 = fieldNorm(doc=5642)
          0.029769275 = weight(abstract_txt:large in 5642) [ClassicSimilarity], result of:
            0.029769275 = score(doc=5642,freq=1.0), product of:
              0.08554986 = queryWeight, product of:
                1.2817008 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.0149855865 = queryNorm
              0.34797573 = fieldWeight in 5642, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.078125 = fieldNorm(doc=5642)
          0.04725941 = weight(abstract_txt:text in 5642) [ClassicSimilarity], result of:
            0.04725941 = score(doc=5642,freq=2.0), product of:
              0.105775826 = queryWeight, product of:
                1.7454839 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0149855865 = queryNorm
              0.44678837 = fieldWeight in 5642, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=5642)
          0.094144024 = weight(abstract_txt:filtering in 5642) [ClassicSimilarity], result of:
            0.094144024 = score(doc=5642,freq=1.0), product of:
              0.18431853 = queryWeight, product of:
                1.881315 = boost
                6.537832 = idf(docFreq=173, maxDocs=44218)
                0.0149855865 = queryNorm
              0.5107681 = fieldWeight in 5642, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.537832 = idf(docFreq=173, maxDocs=44218)
                0.078125 = fieldNorm(doc=5642)
          0.0516813 = weight(abstract_txt:system in 5642) [ClassicSimilarity], result of:
            0.0516813 = score(doc=5642,freq=4.0), product of:
              0.098081276 = queryWeight, product of:
                1.9408191 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0149855865 = queryNorm
              0.5269232 = fieldWeight in 5642, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.078125 = fieldNorm(doc=5642)
          0.13763918 = weight(abstract_txt:irrelevant in 5642) [ClassicSimilarity], result of:
            0.13763918 = score(doc=5642,freq=1.0), product of:
              0.2374298 = queryWeight, product of:
                2.1352298 = boost
                7.4202213 = idf(docFreq=71, maxDocs=44218)
                0.0149855865 = queryNorm
              0.57970476 = fieldWeight in 5642, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4202213 = idf(docFreq=71, maxDocs=44218)
                0.078125 = fieldNorm(doc=5642)
          0.15540181 = weight(abstract_txt:customized in 5642) [ClassicSimilarity], result of:
            0.15540181 = score(doc=5642,freq=1.0), product of:
              0.2574411 = queryWeight, product of:
                2.2233915 = boost
                7.7265954 = idf(docFreq=52, maxDocs=44218)
                0.0149855865 = queryNorm
              0.60364026 = fieldWeight in 5642, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7265954 = idf(docFreq=52, maxDocs=44218)
                0.078125 = fieldNorm(doc=5642)
        0.32 = coord(8/25)
    
  3. Knowledge management in fuzzy databases (2000) 0.12
    0.115752846 = sum of:
      0.115752846 = product of:
        0.7234553 = sum of:
          0.0239941 = weight(abstract_txt:retrieval in 4260) [ClassicSimilarity], result of:
            0.0239941 = score(doc=4260,freq=2.0), product of:
              0.052077003 = queryWeight, product of:
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0149855865 = queryNorm
              0.4607427 = fieldWeight in 4260, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=4260)
          0.012168433 = weight(abstract_txt:information in 4260) [ClassicSimilarity], result of:
            0.012168433 = score(doc=4260,freq=2.0), product of:
              0.03791082 = queryWeight, product of:
                1.044971 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0149855865 = queryNorm
              0.32097518 = fieldWeight in 4260, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.09375 = fieldNorm(doc=4260)
          0.3368354 = weight(abstract_txt:fuzzy in 4260) [ClassicSimilarity], result of:
            0.3368354 = score(doc=4260,freq=3.0), product of:
              0.30305502 = queryWeight, product of:
                2.954496 = boost
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.0149855865 = queryNorm
              1.1114662 = fieldWeight in 4260, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.09375 = fieldNorm(doc=4260)
          0.35045737 = weight(abstract_txt:rough in 4260) [ClassicSimilarity], result of:
            0.35045737 = score(doc=4260,freq=1.0), product of:
              0.44878694 = queryWeight, product of:
                3.5953631 = boost
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.0149855865 = queryNorm
              0.7808992 = fieldWeight in 4260, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.09375 = fieldNorm(doc=4260)
        0.16 = coord(4/25)
    
  4. Martin-Bautista, M.J.; Vila, M.-A.; Larsen, H.L.: ¬A fuzzy genetic algorithm approach to an adaptive information retrieval agent (1999) 0.11
    0.11419638 = sum of:
      0.11419638 = product of:
        0.47581828 = sum of:
          0.014138659 = weight(abstract_txt:retrieval in 3914) [ClassicSimilarity], result of:
            0.014138659 = score(doc=3914,freq=1.0), product of:
              0.052077003 = queryWeight, product of:
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0149855865 = queryNorm
              0.27149525 = fieldWeight in 3914, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=3914)
          0.012419354 = weight(abstract_txt:information in 3914) [ClassicSimilarity], result of:
            0.012419354 = score(doc=3914,freq=3.0), product of:
              0.03791082 = queryWeight, product of:
                1.044971 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0149855865 = queryNorm
              0.32759392 = fieldWeight in 3914, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.078125 = fieldNorm(doc=3914)
          0.13296227 = weight(abstract_txt:user's in 3914) [ClassicSimilarity], result of:
            0.13296227 = score(doc=3914,freq=4.0), product of:
              0.14616367 = queryWeight, product of:
                1.675316 = boost
                5.8219566 = idf(docFreq=355, maxDocs=44218)
                0.0149855865 = queryNorm
              0.9096807 = fieldWeight in 3914, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.8219566 = idf(docFreq=355, maxDocs=44218)
                0.078125 = fieldNorm(doc=3914)
          0.061269857 = weight(abstract_txt:documents in 3914) [ClassicSimilarity], result of:
            0.061269857 = score(doc=3914,freq=3.0), product of:
              0.109865606 = queryWeight, product of:
                1.7789081 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0149855865 = queryNorm
              0.5576801 = fieldWeight in 3914, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=3914)
          0.02584065 = weight(abstract_txt:system in 3914) [ClassicSimilarity], result of:
            0.02584065 = score(doc=3914,freq=1.0), product of:
              0.098081276 = queryWeight, product of:
                1.9408191 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0149855865 = queryNorm
              0.2634616 = fieldWeight in 3914, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.078125 = fieldNorm(doc=3914)
          0.22918746 = weight(abstract_txt:fuzzy in 3914) [ClassicSimilarity], result of:
            0.22918746 = score(doc=3914,freq=2.0), product of:
              0.30305502 = queryWeight, product of:
                2.954496 = boost
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.0149855865 = queryNorm
              0.75625694 = fieldWeight in 3914, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.078125 = fieldNorm(doc=3914)
        0.24 = coord(6/25)
    
  5. Torra, V.; Miyamoto, S.; Lanau, S.: Exploration of textual document archives using a fuzzy hierarchical clustering algorithm in the GAMBAL system (2005) 0.11
    0.10841158 = sum of:
      0.10841158 = product of:
        0.3871842 = sum of:
          0.014138659 = weight(abstract_txt:retrieval in 1028) [ClassicSimilarity], result of:
            0.014138659 = score(doc=1028,freq=1.0), product of:
              0.052077003 = queryWeight, product of:
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0149855865 = queryNorm
              0.27149525 = fieldWeight in 1028, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=1028)
          0.010140359 = weight(abstract_txt:information in 1028) [ClassicSimilarity], result of:
            0.010140359 = score(doc=1028,freq=2.0), product of:
              0.03791082 = queryWeight, product of:
                1.044971 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0149855865 = queryNorm
              0.2674793 = fieldWeight in 1028, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.078125 = fieldNorm(doc=1028)
          0.016837949 = weight(abstract_txt:user in 1028) [ClassicSimilarity], result of:
            0.016837949 = score(doc=1028,freq=1.0), product of:
              0.058510426 = queryWeight, product of:
                1.0599701 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.0149855865 = queryNorm
              0.2877769 = fieldWeight in 1028, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.078125 = fieldNorm(doc=1028)
          0.029769275 = weight(abstract_txt:large in 1028) [ClassicSimilarity], result of:
            0.029769275 = score(doc=1028,freq=1.0), product of:
              0.08554986 = queryWeight, product of:
                1.2817008 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.0149855865 = queryNorm
              0.34797573 = fieldWeight in 1028, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.078125 = fieldNorm(doc=1028)
          0.061269857 = weight(abstract_txt:documents in 1028) [ClassicSimilarity], result of:
            0.061269857 = score(doc=1028,freq=3.0), product of:
              0.109865606 = queryWeight, product of:
                1.7789081 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0149855865 = queryNorm
              0.5576801 = fieldWeight in 1028, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=1028)
          0.02584065 = weight(abstract_txt:system in 1028) [ClassicSimilarity], result of:
            0.02584065 = score(doc=1028,freq=1.0), product of:
              0.098081276 = queryWeight, product of:
                1.9408191 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0149855865 = queryNorm
              0.2634616 = fieldWeight in 1028, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.078125 = fieldNorm(doc=1028)
          0.22918746 = weight(abstract_txt:fuzzy in 1028) [ClassicSimilarity], result of:
            0.22918746 = score(doc=1028,freq=2.0), product of:
              0.30305502 = queryWeight, product of:
                2.954496 = boost
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.0149855865 = queryNorm
              0.75625694 = fieldWeight in 1028, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.078125 = fieldNorm(doc=1028)
        0.28 = coord(7/25)