Document (#25062)

Author
Larouk, O.
Title
Modelling users need : schemas of interrogation and filtering of answers from the WEB in co-operative mode
Source
Structures and relations in knowledge organization: Proceedings of the 5th International ISKO-Conference, Lille, 25.-29.8.1998. Ed.: W. Mustafa el Hadi et al
Imprint
Würzburg : Ergon
Year
1998
Pages
S.106-115
Series
Advances in knowledge organization; vol.6
Abstract
Textual analysis is a part of information processing systems. The access to digital data through WEB servers is facilitated by search engines. Following a request, the user is presented with a long list of WEB page references. The efficient selection of relevant documents is very difficult given the low precision in the list. Generally, the user visits the first page referenced in the list but he doesn't consult the hundredth. As it is difficult to assess the pertinence of all the obtained references, the searcher needs tools to filter the list. The aim of the present paper is to suggest a method of filtering based on the URL addresses, titles and abstracts. This filtering will enable the searcher to build a set of pages and so improve on the initial search formulation. This process falls within the scope of modeling the user's profile as a means to improve access to more relevant information. It uses classification algorithms to extract more relevant 'terms' in titles and abstracts, thanks to texts accepted or rejected interactively by the user in the process of filtering. The problem of information searching in texts is mainly linguistic. The objective is to construct a system of automatic indexing using the Noun Phrases (NP) model. The intensional predicate/NP instances are built from the retrieval, navigation and filtering of the references captured from the WEB. The questions that are now posed are: Can they play the role of descriptors in textual databases? How should they be organized in a documentary indexing system for the future research of information ?

Similar documents (content)

  1. García Cumbreras, M.A.; Perea-Ortega, J.M.; García Vega, M.; Ureña López, L.A.: Information retrieval with geographical references : relevant documents filtering vs. query expansion (2009) 0.18
    0.1770225 = sum of:
      0.1770225 = product of:
        0.8851125 = sum of:
          0.018545844 = weight(abstract_txt:information in 1223) [ClassicSimilarity], result of:
            0.018545844 = score(doc=1223,freq=2.0), product of:
              0.05742581 = queryWeight, product of:
                1.1710215 = boost
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.020132098 = queryNorm
              0.3229531 = fieldWeight in 1223, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.09375 = fieldNorm(doc=1223)
          0.057056673 = weight(abstract_txt:improve in 1223) [ClassicSimilarity], result of:
            0.057056673 = score(doc=1223,freq=1.0), product of:
              0.12147316 = queryWeight, product of:
                1.2043049 = boost
                5.010197 = idf(docFreq=766, maxDocs=42306)
                0.020132098 = queryNorm
              0.469706 = fieldWeight in 1223, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.010197 = idf(docFreq=766, maxDocs=42306)
                0.09375 = fieldNorm(doc=1223)
          0.11946211 = weight(abstract_txt:relevant in 1223) [ClassicSimilarity], result of:
            0.11946211 = score(doc=1223,freq=3.0), product of:
              0.15779243 = queryWeight, product of:
                1.681067 = boost
                4.662428 = idf(docFreq=1085, maxDocs=42306)
                0.020132098 = queryNorm
              0.7570839 = fieldWeight in 1223, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.662428 = idf(docFreq=1085, maxDocs=42306)
                0.09375 = fieldNorm(doc=1223)
          0.1418749 = weight(abstract_txt:list in 1223) [ClassicSimilarity], result of:
            0.1418749 = score(doc=1223,freq=1.0), product of:
              0.280903 = queryWeight, product of:
                2.5899386 = boost
                5.387383 = idf(docFreq=525, maxDocs=42306)
                0.020132098 = queryNorm
              0.50506717 = fieldWeight in 1223, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.387383 = idf(docFreq=525, maxDocs=42306)
                0.09375 = fieldNorm(doc=1223)
          0.548173 = weight(abstract_txt:filtering in 1223) [ClassicSimilarity], result of:
            0.548173 = score(doc=1223,freq=3.0), product of:
              0.5166075 = queryWeight, product of:
                3.926871 = boost
                6.5346904 = idf(docFreq=166, maxDocs=42306)
                0.020132098 = queryNorm
              1.0611014 = fieldWeight in 1223, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.5346904 = idf(docFreq=166, maxDocs=42306)
                0.09375 = fieldNorm(doc=1223)
        0.2 = coord(5/25)
    
  2. Mukhopadhyay, S.; Peng, S.; Raje, R.; Mostafa, J.; Palakal, M.: Distributed multi-agent information filtering : a comparative study (2005) 0.15
    0.15461196 = sum of:
      0.15461196 = product of:
        0.7730598 = sum of:
          0.012407168 = weight(abstract_txt:from in 4560) [ClassicSimilarity], result of:
            0.012407168 = score(doc=4560,freq=1.0), product of:
              0.05678179 = queryWeight, product of:
                1.0084317 = boost
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.020132098 = queryNorm
              0.2185061 = fieldWeight in 4560, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.078125 = fieldNorm(doc=4560)
          0.028913414 = weight(abstract_txt:information in 4560) [ClassicSimilarity], result of:
            0.028913414 = score(doc=4560,freq=7.0), product of:
              0.05742581 = queryWeight, product of:
                1.1710215 = boost
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.020132098 = queryNorm
              0.50349164 = fieldWeight in 4560, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.078125 = fieldNorm(doc=4560)
          0.028234797 = weight(abstract_txt:user in 4560) [ClassicSimilarity], result of:
            0.028234797 = score(doc=4560,freq=1.0), product of:
              0.09823896 = queryWeight, product of:
                1.3264284 = boost
                3.67884 = idf(docFreq=2903, maxDocs=42306)
                0.020132098 = queryNorm
              0.28740937 = fieldWeight in 4560, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.67884 = idf(docFreq=2903, maxDocs=42306)
                0.078125 = fieldNorm(doc=4560)
          0.057476237 = weight(abstract_txt:relevant in 4560) [ClassicSimilarity], result of:
            0.057476237 = score(doc=4560,freq=1.0), product of:
              0.15779243 = queryWeight, product of:
                1.681067 = boost
                4.662428 = idf(docFreq=1085, maxDocs=42306)
                0.020132098 = queryNorm
              0.36425218 = fieldWeight in 4560, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.662428 = idf(docFreq=1085, maxDocs=42306)
                0.078125 = fieldNorm(doc=4560)
          0.64602816 = weight(abstract_txt:filtering in 4560) [ClassicSimilarity], result of:
            0.64602816 = score(doc=4560,freq=6.0), product of:
              0.5166075 = queryWeight, product of:
                3.926871 = boost
                6.5346904 = idf(docFreq=166, maxDocs=42306)
                0.020132098 = queryNorm
              1.2505202 = fieldWeight in 4560, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.5346904 = idf(docFreq=166, maxDocs=42306)
                0.078125 = fieldNorm(doc=4560)
        0.2 = coord(5/25)
    
  3. Furner, J.: On Recommending (2002) 0.14
    0.14190689 = sum of:
      0.14190689 = product of:
        0.5068103 = sum of:
          0.009925734 = weight(abstract_txt:from in 244) [ClassicSimilarity], result of:
            0.009925734 = score(doc=244,freq=1.0), product of:
              0.05678179 = queryWeight, product of:
                1.0084317 = boost
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.020132098 = queryNorm
              0.17480488 = fieldWeight in 244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.0625 = fieldNorm(doc=244)
          0.012363895 = weight(abstract_txt:information in 244) [ClassicSimilarity], result of:
            0.012363895 = score(doc=244,freq=2.0), product of:
              0.05742581 = queryWeight, product of:
                1.1710215 = boost
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.020132098 = queryNorm
              0.21530207 = fieldWeight in 244, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.0625 = fieldNorm(doc=244)
          0.03912328 = weight(abstract_txt:user in 244) [ClassicSimilarity], result of:
            0.03912328 = score(doc=244,freq=3.0), product of:
              0.09823896 = queryWeight, product of:
                1.3264284 = boost
                3.67884 = idf(docFreq=2903, maxDocs=42306)
                0.020132098 = queryNorm
              0.3982461 = fieldWeight in 244, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.67884 = idf(docFreq=2903, maxDocs=42306)
                0.0625 = fieldNorm(doc=244)
          0.09384131 = weight(abstract_txt:searcher in 244) [ClassicSimilarity], result of:
            0.09384131 = score(doc=244,freq=1.0), product of:
              0.22178538 = queryWeight, product of:
                1.6272818 = boost
                6.769882 = idf(docFreq=131, maxDocs=42306)
                0.020132098 = queryNorm
              0.42311764 = fieldWeight in 244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.769882 = idf(docFreq=131, maxDocs=42306)
                0.0625 = fieldNorm(doc=244)
          0.04598099 = weight(abstract_txt:relevant in 244) [ClassicSimilarity], result of:
            0.04598099 = score(doc=244,freq=1.0), product of:
              0.15779243 = queryWeight, product of:
                1.681067 = boost
                4.662428 = idf(docFreq=1085, maxDocs=42306)
                0.020132098 = queryNorm
              0.29140174 = fieldWeight in 244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.662428 = idf(docFreq=1085, maxDocs=42306)
                0.0625 = fieldNorm(doc=244)
          0.09458326 = weight(abstract_txt:list in 244) [ClassicSimilarity], result of:
            0.09458326 = score(doc=244,freq=1.0), product of:
              0.280903 = queryWeight, product of:
                2.5899386 = boost
                5.387383 = idf(docFreq=525, maxDocs=42306)
                0.020132098 = queryNorm
              0.33671144 = fieldWeight in 244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.387383 = idf(docFreq=525, maxDocs=42306)
                0.0625 = fieldNorm(doc=244)
          0.21099189 = weight(abstract_txt:filtering in 244) [ClassicSimilarity], result of:
            0.21099189 = score(doc=244,freq=1.0), product of:
              0.5166075 = queryWeight, product of:
                3.926871 = boost
                6.5346904 = idf(docFreq=166, maxDocs=42306)
                0.020132098 = queryNorm
              0.40841815 = fieldWeight in 244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5346904 = idf(docFreq=166, maxDocs=42306)
                0.0625 = fieldNorm(doc=244)
        0.28 = coord(7/25)
    
  4. Elovici, Y.; Shapira, Y.B.; Kantor, P.B.: ¬A decision theoretic approach to combining information filters : an analytical and empirical evaluation. (2006) 0.14
    0.13814965 = sum of:
      0.13814965 = product of:
        0.6907482 = sum of:
          0.014888601 = weight(abstract_txt:from in 268) [ClassicSimilarity], result of:
            0.014888601 = score(doc=268,freq=1.0), product of:
              0.05678179 = queryWeight, product of:
                1.0084317 = boost
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.020132098 = queryNorm
              0.26220733 = fieldWeight in 268, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.09375 = fieldNorm(doc=268)
          0.022713928 = weight(abstract_txt:information in 268) [ClassicSimilarity], result of:
            0.022713928 = score(doc=268,freq=3.0), product of:
              0.05742581 = queryWeight, product of:
                1.1710215 = boost
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.020132098 = queryNorm
              0.39553517 = fieldWeight in 268, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.09375 = fieldNorm(doc=268)
          0.057056673 = weight(abstract_txt:improve in 268) [ClassicSimilarity], result of:
            0.057056673 = score(doc=268,freq=1.0), product of:
              0.12147316 = queryWeight, product of:
                1.2043049 = boost
                5.010197 = idf(docFreq=766, maxDocs=42306)
                0.020132098 = queryNorm
              0.469706 = fieldWeight in 268, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.010197 = idf(docFreq=766, maxDocs=42306)
                0.09375 = fieldNorm(doc=268)
          0.04791604 = weight(abstract_txt:user in 268) [ClassicSimilarity], result of:
            0.04791604 = score(doc=268,freq=2.0), product of:
              0.09823896 = queryWeight, product of:
                1.3264284 = boost
                3.67884 = idf(docFreq=2903, maxDocs=42306)
                0.020132098 = queryNorm
              0.48774987 = fieldWeight in 268, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.67884 = idf(docFreq=2903, maxDocs=42306)
                0.09375 = fieldNorm(doc=268)
          0.548173 = weight(abstract_txt:filtering in 268) [ClassicSimilarity], result of:
            0.548173 = score(doc=268,freq=3.0), product of:
              0.5166075 = queryWeight, product of:
                3.926871 = boost
                6.5346904 = idf(docFreq=166, maxDocs=42306)
                0.020132098 = queryNorm
              1.0611014 = fieldWeight in 268, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.5346904 = idf(docFreq=166, maxDocs=42306)
                0.09375 = fieldNorm(doc=268)
        0.2 = coord(5/25)
    
  5. Orrico, E.G.D.: Metaphorical representations of the thematic identity of social Groups in the assistance of information retrieval (2003) 0.13
    0.13033755 = sum of:
      0.13033755 = product of:
        0.5430731 = sum of:
          0.012407168 = weight(abstract_txt:from in 3777) [ClassicSimilarity], result of:
            0.012407168 = score(doc=3777,freq=1.0), product of:
              0.05678179 = queryWeight, product of:
                1.0084317 = boost
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.020132098 = queryNorm
              0.2185061 = fieldWeight in 3777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.078125 = fieldNorm(doc=3777)
          0.018928273 = weight(abstract_txt:information in 3777) [ClassicSimilarity], result of:
            0.018928273 = score(doc=3777,freq=3.0), product of:
              0.05742581 = queryWeight, product of:
                1.1710215 = boost
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.020132098 = queryNorm
              0.32961264 = fieldWeight in 3777, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.078125 = fieldNorm(doc=3777)
          0.047547225 = weight(abstract_txt:improve in 3777) [ClassicSimilarity], result of:
            0.047547225 = score(doc=3777,freq=1.0), product of:
              0.12147316 = queryWeight, product of:
                1.2043049 = boost
                5.010197 = idf(docFreq=766, maxDocs=42306)
                0.020132098 = queryNorm
              0.39142165 = fieldWeight in 3777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.010197 = idf(docFreq=766, maxDocs=42306)
                0.078125 = fieldNorm(doc=3777)
          0.062971175 = weight(abstract_txt:difficult in 3777) [ClassicSimilarity], result of:
            0.062971175 = score(doc=3777,freq=1.0), product of:
              0.14649566 = queryWeight, product of:
                1.3225396 = boost
                5.5020814 = idf(docFreq=468, maxDocs=42306)
                0.020132098 = queryNorm
              0.4298501 = fieldWeight in 3777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5020814 = idf(docFreq=468, maxDocs=42306)
                0.078125 = fieldNorm(doc=3777)
          0.028234797 = weight(abstract_txt:user in 3777) [ClassicSimilarity], result of:
            0.028234797 = score(doc=3777,freq=1.0), product of:
              0.09823896 = queryWeight, product of:
                1.3264284 = boost
                3.67884 = idf(docFreq=2903, maxDocs=42306)
                0.020132098 = queryNorm
              0.28740937 = fieldWeight in 3777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.67884 = idf(docFreq=2903, maxDocs=42306)
                0.078125 = fieldNorm(doc=3777)
          0.37298447 = weight(abstract_txt:filtering in 3777) [ClassicSimilarity], result of:
            0.37298447 = score(doc=3777,freq=2.0), product of:
              0.5166075 = queryWeight, product of:
                3.926871 = boost
                6.5346904 = idf(docFreq=166, maxDocs=42306)
                0.020132098 = queryNorm
              0.7219881 = fieldWeight in 3777, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5346904 = idf(docFreq=166, maxDocs=42306)
                0.078125 = fieldNorm(doc=3777)
        0.24 = coord(6/25)