Document (#31972)

Author
Rui, Y.
Ortega, M.
Huang, T.S.
Mehrotra, S.
Title
Information retrieval beyond the text document
Source
Library trends. 48(1999) no.2, S.455-474
Year
1999
Abstract
With the expansion of the Internet, searching for information goes beyond the boundary of physical libraries. Millions of documents of various media types-such as text, image, video, audio, graphics, and animation-are available around the world and linked by the Internet. Unfortunately, the state of the art of search engines for media types other than text lags far behind their text counterparts. To address this situation, we have developed the Multimedia Analysis and Retrieval System (MARS). This article reports some of the progress made over the years toward exploring information retrieval beyond the text domain. In particular, the following aspects of MARS are addressed in the article: visual feature extraction, retrieval models, query reformulation techniques, efficient execution speed performance, and user interface considerations. Extensive experimental results are reported to validate the proposed approaches.
Form
Bilder

Similar documents (author)

  1. Ortega, C.D.: Conceptual and procedural grounding of documentary systems (2012) 2.34
    2.3378863 = sum of:
      2.3378863 = product of:
        4.6757727 = sum of:
          4.6757727 = weight(author_txt:ortega in 143) [ClassicSimilarity], result of:
            4.6757727 = score(doc=143,freq=1.0), product of:
              0.8382997 = queryWeight, product of:
                1.2399892 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.07575431 = queryNorm
              5.5776863 = fieldWeight in 143, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.625 = fieldNorm(doc=143)
        0.5 = coord(1/2)
    
  2. Ortega, J.L.: ¬The presence of academic journals on Twitter and its relationship with dissemination (tweets) and research impact (citations) (2017) 2.34
    2.3378863 = sum of:
      2.3378863 = product of:
        4.6757727 = sum of:
          4.6757727 = weight(author_txt:ortega in 4410) [ClassicSimilarity], result of:
            4.6757727 = score(doc=4410,freq=1.0), product of:
              0.8382997 = queryWeight, product of:
                1.2399892 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.07575431 = queryNorm
              5.5776863 = fieldWeight in 4410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.625 = fieldNorm(doc=4410)
        0.5 = coord(1/2)
    
  3. Ortega, J.L.: Classification and analysis of PubPeer comments : how a web journal club is used (2022) 2.34
    2.3378863 = sum of:
      2.3378863 = product of:
        4.6757727 = sum of:
          4.6757727 = weight(author_txt:ortega in 544) [ClassicSimilarity], result of:
            4.6757727 = score(doc=544,freq=1.0), product of:
              0.8382997 = queryWeight, product of:
                1.2399892 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.07575431 = queryNorm
              5.5776863 = fieldWeight in 544, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.625 = fieldNorm(doc=544)
        0.5 = coord(1/2)
    
  4. Ortega, C. Dotta => Cristina Dotta Ortega, C.D.: 1.98
    1.9837624 = sum of:
      1.9837624 = product of:
        3.9675248 = sum of:
          3.9675248 = weight(author_txt:ortega in 4706) [ClassicSimilarity], result of:
            3.9675248 = score(doc=4706,freq=2.0), product of:
              0.8382997 = queryWeight, product of:
                1.2399892 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.07575431 = queryNorm
              4.732824 = fieldWeight in 4706, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.375 = fieldNorm(doc=4706)
        0.5 = coord(1/2)
    
  5. Ortega, J.L.; Aguillo, I.F.: Visualization of the Nordic academic web : link analysis using social network tools (2008) 1.87
    1.8703091 = sum of:
      1.8703091 = product of:
        3.7406182 = sum of:
          3.7406182 = weight(author_txt:ortega in 2114) [ClassicSimilarity], result of:
            3.7406182 = score(doc=2114,freq=1.0), product of:
              0.8382997 = queryWeight, product of:
                1.2399892 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.07575431 = queryNorm
              4.462149 = fieldWeight in 2114, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.5 = fieldNorm(doc=2114)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Huang, T.; Mehrotra, S.; Ramchandran, K.: Multimedia Access and Retrieval System (MARS) project (1997) 0.25
    0.25218058 = sum of:
      0.25218058 = product of:
        2.1015048 = sum of:
          0.020483602 = weight(abstract_txt:information in 758) [ClassicSimilarity], result of:
            0.020483602 = score(doc=758,freq=2.0), product of:
              0.054700095 = queryWeight, product of:
                1.1730233 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.019261774 = queryNorm
              0.37447104 = fieldWeight in 758, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.109375 = fieldNorm(doc=758)
          0.08078045 = weight(abstract_txt:retrieval in 758) [ClassicSimilarity], result of:
            0.08078045 = score(doc=758,freq=2.0), product of:
              0.15027992 = queryWeight, product of:
                2.2450833 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.019261774 = queryNorm
              0.53753316 = fieldWeight in 758, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.109375 = fieldNorm(doc=758)
          2.0002408 = weight(title_txt:mars in 758) [ClassicSimilarity], result of:
            2.0002408 = score(doc=758,freq=1.0), product of:
              0.5614911 = queryWeight, product of:
                3.0685866 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.019261774 = queryNorm
              3.5623734 = fieldWeight in 758, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.375 = fieldNorm(doc=758)
        0.12 = coord(3/25)
    
  2. Kowalski, G.J.; Maybury, M.T.: Information storage and retrieval systems : theory and implemetation (2000) 0.14
    0.13793314 = sum of:
      0.13793314 = product of:
        0.49261832 = sum of:
          0.08692017 = weight(abstract_txt:audio in 6727) [ClassicSimilarity], result of:
            0.08692017 = score(doc=6727,freq=1.0), product of:
              0.13880284 = queryWeight, product of:
                1.0788254 = boost
                6.6796074 = idf(docFreq=150, maxDocs=44218)
                0.019261774 = queryNorm
              0.6262132 = fieldWeight in 6727, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6796074 = idf(docFreq=150, maxDocs=44218)
                0.09375 = fieldNorm(doc=6727)
          0.10027496 = weight(abstract_txt:graphics in 6727) [ClassicSimilarity], result of:
            0.10027496 = score(doc=6727,freq=1.0), product of:
              0.15267912 = queryWeight, product of:
                1.1314667 = boost
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.019261774 = queryNorm
              0.65676934 = fieldWeight in 6727, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.09375 = fieldNorm(doc=6727)
          0.017557373 = weight(abstract_txt:information in 6727) [ClassicSimilarity], result of:
            0.017557373 = score(doc=6727,freq=2.0), product of:
              0.054700095 = queryWeight, product of:
                1.1730233 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.019261774 = queryNorm
              0.32097518 = fieldWeight in 6727, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.09375 = fieldNorm(doc=6727)
          0.030202307 = weight(abstract_txt:internet in 6727) [ClassicSimilarity], result of:
            0.030202307 = score(doc=6727,freq=1.0), product of:
              0.086434685 = queryWeight, product of:
                1.2039571 = boost
                3.7271836 = idf(docFreq=2891, maxDocs=44218)
                0.019261774 = queryNorm
              0.34942347 = fieldWeight in 6727, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7271836 = idf(docFreq=2891, maxDocs=44218)
                0.09375 = fieldNorm(doc=6727)
          0.052045442 = weight(abstract_txt:types in 6727) [ClassicSimilarity], result of:
            0.052045442 = score(doc=6727,freq=1.0), product of:
              0.124236666 = queryWeight, product of:
                1.4434172 = boost
                4.4684987 = idf(docFreq=1377, maxDocs=44218)
                0.019261774 = queryNorm
              0.41892177 = fieldWeight in 6727, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4684987 = idf(docFreq=1377, maxDocs=44218)
                0.09375 = fieldNorm(doc=6727)
          0.06924038 = weight(abstract_txt:retrieval in 6727) [ClassicSimilarity], result of:
            0.06924038 = score(doc=6727,freq=2.0), product of:
              0.15027992 = queryWeight, product of:
                2.2450833 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.019261774 = queryNorm
              0.4607427 = fieldWeight in 6727, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=6727)
          0.13637768 = weight(abstract_txt:text in 6727) [ClassicSimilarity], result of:
            0.13637768 = score(doc=6727,freq=2.0), product of:
              0.25436667 = queryWeight, product of:
                3.2656307 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.019261774 = queryNorm
              0.53614604 = fieldWeight in 6727, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.09375 = fieldNorm(doc=6727)
        0.28 = coord(7/25)
    
  3. Next generation search engines : advanced models for information retrieval (2012) 0.12
    0.11666023 = sum of:
      0.11666023 = product of:
        0.36456323 = sum of:
          0.036216736 = weight(abstract_txt:audio in 357) [ClassicSimilarity], result of:
            0.036216736 = score(doc=357,freq=1.0), product of:
              0.13880284 = queryWeight, product of:
                1.0788254 = boost
                6.6796074 = idf(docFreq=150, maxDocs=44218)
                0.019261774 = queryNorm
              0.26092216 = fieldWeight in 357, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6796074 = idf(docFreq=150, maxDocs=44218)
                0.0390625 = fieldNorm(doc=357)
          0.042113457 = weight(abstract_txt:goes in 357) [ClassicSimilarity], result of:
            0.042113457 = score(doc=357,freq=1.0), product of:
              0.1534874 = queryWeight, product of:
                1.1344578 = boost
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.019261774 = queryNorm
              0.2743773 = fieldWeight in 357, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.0390625 = fieldNorm(doc=357)
          0.022548107 = weight(abstract_txt:information in 357) [ClassicSimilarity], result of:
            0.022548107 = score(doc=357,freq=19.0), product of:
              0.054700095 = queryWeight, product of:
                1.1730233 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.019261774 = queryNorm
              0.41221333 = fieldWeight in 357, product of:
                4.358899 = tf(freq=19.0), with freq of:
                  19.0 = termFreq=19.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0390625 = fieldNorm(doc=357)
          0.01779688 = weight(abstract_txt:internet in 357) [ClassicSimilarity], result of:
            0.01779688 = score(doc=357,freq=2.0), product of:
              0.086434685 = queryWeight, product of:
                1.2039571 = boost
                3.7271836 = idf(docFreq=2891, maxDocs=44218)
                0.019261774 = queryNorm
              0.20589975 = fieldWeight in 357, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7271836 = idf(docFreq=2891, maxDocs=44218)
                0.0390625 = fieldNorm(doc=357)
          0.0216856 = weight(abstract_txt:types in 357) [ClassicSimilarity], result of:
            0.0216856 = score(doc=357,freq=1.0), product of:
              0.124236666 = queryWeight, product of:
                1.4434172 = boost
                4.4684987 = idf(docFreq=1377, maxDocs=44218)
                0.019261774 = queryNorm
              0.17455073 = fieldWeight in 357, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4684987 = idf(docFreq=1377, maxDocs=44218)
                0.0390625 = fieldNorm(doc=357)
          0.04561611 = weight(abstract_txt:retrieval in 357) [ClassicSimilarity], result of:
            0.04561611 = score(doc=357,freq=5.0), product of:
              0.15027992 = queryWeight, product of:
                2.2450833 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.019261774 = queryNorm
              0.30354095 = fieldWeight in 357, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0390625 = fieldNorm(doc=357)
          0.09822505 = weight(abstract_txt:beyond in 357) [ClassicSimilarity], result of:
            0.09822505 = score(doc=357,freq=2.0), product of:
              0.30900872 = queryWeight, product of:
                2.7880335 = boost
                5.754088 = idf(docFreq=380, maxDocs=44218)
                0.019261774 = queryNorm
              0.31787145 = fieldWeight in 357, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.754088 = idf(docFreq=380, maxDocs=44218)
                0.0390625 = fieldNorm(doc=357)
          0.08036132 = weight(abstract_txt:text in 357) [ClassicSimilarity], result of:
            0.08036132 = score(doc=357,freq=4.0), product of:
              0.25436667 = queryWeight, product of:
                3.2656307 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.019261774 = queryNorm
              0.3159271 = fieldWeight in 357, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0390625 = fieldNorm(doc=357)
        0.32 = coord(8/25)
    
  4. Hearst, M.A.: Search user interfaces (2009) 0.12
    0.116339475 = sum of:
      0.116339475 = product of:
        0.48474783 = sum of:
          0.078448534 = weight(abstract_txt:behind in 4029) [ClassicSimilarity], result of:
            0.078448534 = score(doc=4029,freq=1.0), product of:
              0.1296307 = queryWeight, product of:
                1.0425717 = boost
                6.45514 = idf(docFreq=188, maxDocs=44218)
                0.019261774 = queryNorm
              0.6051694 = fieldWeight in 4029, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.45514 = idf(docFreq=188, maxDocs=44218)
                0.09375 = fieldNorm(doc=4029)
          0.0804472 = weight(abstract_txt:considerations in 4029) [ClassicSimilarity], result of:
            0.0804472 = score(doc=4029,freq=1.0), product of:
              0.13182323 = queryWeight, product of:
                1.0513515 = boost
                6.5095015 = idf(docFreq=178, maxDocs=44218)
                0.019261774 = queryNorm
              0.61026573 = fieldWeight in 4029, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5095015 = idf(docFreq=178, maxDocs=44218)
                0.09375 = fieldNorm(doc=4029)
          0.021503301 = weight(abstract_txt:information in 4029) [ClassicSimilarity], result of:
            0.021503301 = score(doc=4029,freq=3.0), product of:
              0.054700095 = queryWeight, product of:
                1.1730233 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.019261774 = queryNorm
              0.3931127 = fieldWeight in 4029, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.09375 = fieldNorm(doc=4029)
          0.13867486 = weight(abstract_txt:reformulation in 4029) [ClassicSimilarity], result of:
            0.13867486 = score(doc=4029,freq=1.0), product of:
              0.1895177 = queryWeight, product of:
                1.2605988 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.019261774 = queryNorm
              0.73172504 = fieldWeight in 4029, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.09375 = fieldNorm(doc=4029)
          0.06924038 = weight(abstract_txt:retrieval in 4029) [ClassicSimilarity], result of:
            0.06924038 = score(doc=4029,freq=2.0), product of:
              0.15027992 = queryWeight, product of:
                2.2450833 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.019261774 = queryNorm
              0.4607427 = fieldWeight in 4029, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=4029)
          0.09643358 = weight(abstract_txt:text in 4029) [ClassicSimilarity], result of:
            0.09643358 = score(doc=4029,freq=1.0), product of:
              0.25436667 = queryWeight, product of:
                3.2656307 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.019261774 = queryNorm
              0.37911248 = fieldWeight in 4029, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.09375 = fieldNorm(doc=4029)
        0.24 = coord(6/25)
    
  5. Barrio, P.; Gravano, L.: Sampling strategies for information extraction over the deep web (2017) 0.09
    0.0930056 = sum of:
      0.0930056 = product of:
        0.465028 = sum of:
          0.12114491 = weight(abstract_txt:extraction in 3412) [ClassicSimilarity], result of:
            0.12114491 = score(doc=3412,freq=9.0), product of:
              0.11926034 = queryWeight, product of:
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.019261774 = queryNorm
              1.0158021 = fieldWeight in 3412, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3412)
          0.020483602 = weight(abstract_txt:information in 3412) [ClassicSimilarity], result of:
            0.020483602 = score(doc=3412,freq=8.0), product of:
              0.054700095 = queryWeight, product of:
                1.1730233 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.019261774 = queryNorm
              0.37447104 = fieldWeight in 3412, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3412)
          0.13417801 = weight(abstract_txt:execution in 3412) [ClassicSimilarity], result of:
            0.13417801 = score(doc=3412,freq=2.0), product of:
              0.21077433 = queryWeight, product of:
                1.3294158 = boost
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.019261774 = queryNorm
              0.6365956 = fieldWeight in 3412, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3412)
          0.040390223 = weight(abstract_txt:retrieval in 3412) [ClassicSimilarity], result of:
            0.040390223 = score(doc=3412,freq=2.0), product of:
              0.15027992 = queryWeight, product of:
                2.2450833 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.019261774 = queryNorm
              0.26876658 = fieldWeight in 3412, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3412)
          0.14883123 = weight(abstract_txt:text in 3412) [ClassicSimilarity], result of:
            0.14883123 = score(doc=3412,freq=7.0), product of:
              0.25436667 = queryWeight, product of:
                3.2656307 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.019261774 = queryNorm
              0.5851051 = fieldWeight in 3412, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3412)
        0.2 = coord(5/25)