Document (#31970)

Author
Rui, Y.
Ortega, M.
Huang, T.S.
Mehrotra, S.
Title
Information retrieval beyond the text document
Source
Library trends. 48(1999) no.2, S.455-474
Year
1999
Abstract
With the expansion of the Internet, searching for information goes beyond the boundary of physical libraries. Millions of documents of various media types-such as text, image, video, audio, graphics, and animation-are available around the world and linked by the Internet. Unfortunately, the state of the art of search engines for media types other than text lags far behind their text counterparts. To address this situation, we have developed the Multimedia Analysis and Retrieval System (MARS). This article reports some of the progress made over the years toward exploring information retrieval beyond the text domain. In particular, the following aspects of MARS are addressed in the article: visual feature extraction, retrieval models, query reformulation techniques, efficient execution speed performance, and user interface considerations. Extensive experimental results are reported to validate the proposed approaches.
Form
Bilder

Similar documents (author)

  1. Ortega, C.D.: Conceptual and procedural grounding of documentary systems (2012) 2.35
    2.3526764 = sum of:
      2.3526764 = product of:
        4.705353 = sum of:
          4.705353 = weight(author_txt:ortega in 2141) [ClassicSimilarity], result of:
            4.705353 = score(doc=2141,freq=1.0), product of:
              0.8389538 = queryWeight, product of:
                1.24162 = boost
                8.973753 = idf(docFreq=14, maxDocs=43556)
                0.07529658 = queryNorm
              5.608596 = fieldWeight in 2141, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.973753 = idf(docFreq=14, maxDocs=43556)
                0.625 = fieldNorm(doc=2141)
        0.5 = coord(1/2)
    
  2. Ortega, J.L.: ¬The presence of academic journals on Twitter and its relationship with dissemination (tweets) and research impact (citations) (2017) 2.35
    2.3526764 = sum of:
      2.3526764 = product of:
        4.705353 = sum of:
          4.705353 = weight(author_txt:ortega in 696) [ClassicSimilarity], result of:
            4.705353 = score(doc=696,freq=1.0), product of:
              0.8389538 = queryWeight, product of:
                1.24162 = boost
                8.973753 = idf(docFreq=14, maxDocs=43556)
                0.07529658 = queryNorm
              5.608596 = fieldWeight in 696, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.973753 = idf(docFreq=14, maxDocs=43556)
                0.625 = fieldNorm(doc=696)
        0.5 = coord(1/2)
    
  3. Ortega, J.L.: Classification and analysis of PubPeer comments : how a web journal club is used (2022) 2.35
    2.3526764 = sum of:
      2.3526764 = product of:
        4.705353 = sum of:
          4.705353 = weight(author_txt:ortega in 2831) [ClassicSimilarity], result of:
            4.705353 = score(doc=2831,freq=1.0), product of:
              0.8389538 = queryWeight, product of:
                1.24162 = boost
                8.973753 = idf(docFreq=14, maxDocs=43556)
                0.07529658 = queryNorm
              5.608596 = fieldWeight in 2831, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.973753 = idf(docFreq=14, maxDocs=43556)
                0.625 = fieldNorm(doc=2831)
        0.5 = coord(1/2)
    
  4. Ortega, C. Dotta => Cristina Dotta Ortega, C.D.: 2.00
    1.9963119 = sum of:
      1.9963119 = product of:
        3.9926238 = sum of:
          3.9926238 = weight(author_txt:ortega in 992) [ClassicSimilarity], result of:
            3.9926238 = score(doc=992,freq=2.0), product of:
              0.8389538 = queryWeight, product of:
                1.24162 = boost
                8.973753 = idf(docFreq=14, maxDocs=43556)
                0.07529658 = queryNorm
              4.759051 = fieldWeight in 992, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.973753 = idf(docFreq=14, maxDocs=43556)
                0.375 = fieldNorm(doc=992)
        0.5 = coord(1/2)
    
  5. Ortega, J.L.; Aguillo, I.F.: Visualization of the Nordic academic web : link analysis using social network tools (2008) 1.88
    1.882141 = sum of:
      1.882141 = product of:
        3.764282 = sum of:
          3.764282 = weight(author_txt:ortega in 4112) [ClassicSimilarity], result of:
            3.764282 = score(doc=4112,freq=1.0), product of:
              0.8389538 = queryWeight, product of:
                1.24162 = boost
                8.973753 = idf(docFreq=14, maxDocs=43556)
                0.07529658 = queryNorm
              4.4868765 = fieldWeight in 4112, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.973753 = idf(docFreq=14, maxDocs=43556)
                0.5 = fieldNorm(doc=4112)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Huang, T.; Mehrotra, S.; Ramchandran, K.: Multimedia Access and Retrieval System (MARS) project (1997) 0.25
    0.2503483 = sum of:
      0.2503483 = product of:
        2.0862358 = sum of:
          0.020536102 = weight(abstract_txt:information in 756) [ClassicSimilarity], result of:
            0.020536102 = score(doc=756,freq=2.0), product of:
              0.054743055 = queryWeight, product of:
                1.1732863 = boost
                2.425247 = idf(docFreq=10472, maxDocs=43556)
                0.019238405 = queryNorm
              0.37513623 = fieldWeight in 756, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.425247 = idf(docFreq=10472, maxDocs=43556)
                0.109375 = fieldNorm(doc=756)
          0.080468185 = weight(abstract_txt:retrieval in 756) [ClassicSimilarity], result of:
            0.080468185 = score(doc=756,freq=2.0), product of:
              0.14975433 = queryWeight, product of:
                2.2407765 = boost
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.019238405 = queryNorm
              0.5373346 = fieldWeight in 756, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.109375 = fieldNorm(doc=756)
          1.9852315 = weight(title_txt:mars in 756) [ClassicSimilarity], result of:
            1.9852315 = score(doc=756,freq=1.0), product of:
              0.55816406 = queryWeight, product of:
                3.058967 = boost
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.019238405 = queryNorm
              3.556717 = fieldWeight in 756, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.375 = fieldNorm(doc=756)
        0.12 = coord(3/25)
    
  2. Kowalski, G.J.; Maybury, M.T.: Information storage and retrieval systems : theory and implemetation (2000) 0.14
    0.13765417 = sum of:
      0.13765417 = product of:
        0.49162203 = sum of:
          0.08661206 = weight(abstract_txt:audio in 725) [ClassicSimilarity], result of:
            0.08661206 = score(doc=725,freq=1.0), product of:
              0.13834709 = queryWeight, product of:
                1.0768715 = boost
                6.6778564 = idf(docFreq=148, maxDocs=43556)
                0.019238405 = queryNorm
              0.62604904 = fieldWeight in 725, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6778564 = idf(docFreq=148, maxDocs=43556)
                0.09375 = fieldNorm(doc=725)
          0.09974706 = weight(abstract_txt:graphics in 725) [ClassicSimilarity], result of:
            0.09974706 = score(doc=725,freq=1.0), product of:
              0.15200266 = queryWeight, product of:
                1.1287674 = boost
                6.9996715 = idf(docFreq=107, maxDocs=43556)
                0.019238405 = queryNorm
              0.6562192 = fieldWeight in 725, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9996715 = idf(docFreq=107, maxDocs=43556)
                0.09375 = fieldNorm(doc=725)
          0.017602375 = weight(abstract_txt:information in 725) [ClassicSimilarity], result of:
            0.017602375 = score(doc=725,freq=2.0), product of:
              0.054743055 = queryWeight, product of:
                1.1732863 = boost
                2.425247 = idf(docFreq=10472, maxDocs=43556)
                0.019238405 = queryNorm
              0.32154533 = fieldWeight in 725, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.425247 = idf(docFreq=10472, maxDocs=43556)
                0.09375 = fieldNorm(doc=725)
          0.029879944 = weight(abstract_txt:internet in 725) [ClassicSimilarity], result of:
            0.029879944 = score(doc=725,freq=1.0), product of:
              0.0857395 = queryWeight, product of:
                1.1989038 = boost
                3.7172995 = idf(docFreq=2876, maxDocs=43556)
                0.019238405 = queryNorm
              0.34849682 = fieldWeight in 725, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7172995 = idf(docFreq=2876, maxDocs=43556)
                0.09375 = fieldNorm(doc=725)
          0.052247442 = weight(abstract_txt:types in 725) [ClassicSimilarity], result of:
            0.052247442 = score(doc=725,freq=1.0), product of:
              0.12444319 = queryWeight, product of:
                1.4443733 = boost
                4.4783974 = idf(docFreq=1343, maxDocs=43556)
                0.019238405 = queryNorm
              0.41984975 = fieldWeight in 725, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4783974 = idf(docFreq=1343, maxDocs=43556)
                0.09375 = fieldNorm(doc=725)
          0.06897273 = weight(abstract_txt:retrieval in 725) [ClassicSimilarity], result of:
            0.06897273 = score(doc=725,freq=2.0), product of:
              0.14975433 = queryWeight, product of:
                2.2407765 = boost
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.019238405 = queryNorm
              0.46057254 = fieldWeight in 725, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.09375 = fieldNorm(doc=725)
          0.1365604 = weight(abstract_txt:text in 725) [ClassicSimilarity], result of:
            0.1365604 = score(doc=725,freq=2.0), product of:
              0.2543593 = queryWeight, product of:
                3.2650337 = boost
                4.0494018 = idf(docFreq=2063, maxDocs=43556)
                0.019238405 = queryNorm
              0.5368799 = fieldWeight in 725, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0494018 = idf(docFreq=2063, maxDocs=43556)
                0.09375 = fieldNorm(doc=725)
        0.28 = coord(7/25)
    
  3. Next generation search engines : advanced models for information retrieval (2012) 0.12
    0.117229365 = sum of:
      0.117229365 = product of:
        0.36634177 = sum of:
          0.03608836 = weight(abstract_txt:audio in 2355) [ClassicSimilarity], result of:
            0.03608836 = score(doc=2355,freq=1.0), product of:
              0.13834709 = queryWeight, product of:
                1.0768715 = boost
                6.6778564 = idf(docFreq=148, maxDocs=43556)
                0.019238405 = queryNorm
              0.26085377 = fieldWeight in 2355, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6778564 = idf(docFreq=148, maxDocs=43556)
                0.0390625 = fieldNorm(doc=2355)
          0.04241138 = weight(abstract_txt:goes in 2355) [ClassicSimilarity], result of:
            0.04241138 = score(doc=2355,freq=1.0), product of:
              0.15406838 = queryWeight, product of:
                1.1364115 = boost
                7.047074 = idf(docFreq=102, maxDocs=43556)
                0.019238405 = queryNorm
              0.27527633 = fieldWeight in 2355, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.047074 = idf(docFreq=102, maxDocs=43556)
                0.0390625 = fieldNorm(doc=2355)
          0.022605902 = weight(abstract_txt:information in 2355) [ClassicSimilarity], result of:
            0.022605902 = score(doc=2355,freq=19.0), product of:
              0.054743055 = queryWeight, product of:
                1.1732863 = boost
                2.425247 = idf(docFreq=10472, maxDocs=43556)
                0.019238405 = queryNorm
              0.41294557 = fieldWeight in 2355, product of:
                4.358899 = tf(freq=19.0), with freq of:
                  19.0 = termFreq=19.0
                2.425247 = idf(docFreq=10472, maxDocs=43556)
                0.0390625 = fieldNorm(doc=2355)
          0.017606925 = weight(abstract_txt:internet in 2355) [ClassicSimilarity], result of:
            0.017606925 = score(doc=2355,freq=2.0), product of:
              0.0857395 = queryWeight, product of:
                1.1989038 = boost
                3.7172995 = idf(docFreq=2876, maxDocs=43556)
                0.019238405 = queryNorm
              0.20535372 = fieldWeight in 2355, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7172995 = idf(docFreq=2876, maxDocs=43556)
                0.0390625 = fieldNorm(doc=2355)
          0.021769768 = weight(abstract_txt:types in 2355) [ClassicSimilarity], result of:
            0.021769768 = score(doc=2355,freq=1.0), product of:
              0.12444319 = queryWeight, product of:
                1.4443733 = boost
                4.4783974 = idf(docFreq=1343, maxDocs=43556)
                0.019238405 = queryNorm
              0.1749374 = fieldWeight in 2355, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4783974 = idf(docFreq=1343, maxDocs=43556)
                0.0390625 = fieldNorm(doc=2355)
          0.045439776 = weight(abstract_txt:retrieval in 2355) [ClassicSimilarity], result of:
            0.045439776 = score(doc=2355,freq=5.0), product of:
              0.14975433 = queryWeight, product of:
                2.2407765 = boost
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.019238405 = queryNorm
              0.3034288 = fieldWeight in 2355, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.0390625 = fieldNorm(doc=2355)
          0.099950686 = weight(abstract_txt:beyond in 2355) [ClassicSimilarity], result of:
            0.099950686 = score(doc=2355,freq=2.0), product of:
              0.3123294 = queryWeight, product of:
                2.8025022 = boost
                5.792925 = idf(docFreq=360, maxDocs=43556)
                0.019238405 = queryNorm
              0.3200169 = fieldWeight in 2355, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.792925 = idf(docFreq=360, maxDocs=43556)
                0.0390625 = fieldNorm(doc=2355)
          0.08046899 = weight(abstract_txt:text in 2355) [ClassicSimilarity], result of:
            0.08046899 = score(doc=2355,freq=4.0), product of:
              0.2543593 = queryWeight, product of:
                3.2650337 = boost
                4.0494018 = idf(docFreq=2063, maxDocs=43556)
                0.019238405 = queryNorm
              0.31635952 = fieldWeight in 2355, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.0494018 = idf(docFreq=2063, maxDocs=43556)
                0.0390625 = fieldNorm(doc=2355)
        0.32 = coord(8/25)
    
  4. Hearst, M.A.: Search user interfaces (2009) 0.12
    0.11698265 = sum of:
      0.11698265 = product of:
        0.4874277 = sum of:
          0.079260476 = weight(abstract_txt:behind in 1027) [ClassicSimilarity], result of:
            0.079260476 = score(doc=1027,freq=1.0), product of:
              0.1304034 = queryWeight, product of:
                1.0454983 = boost
                6.483306 = idf(docFreq=180, maxDocs=43556)
                0.019238405 = queryNorm
              0.6078099 = fieldWeight in 1027, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.483306 = idf(docFreq=180, maxDocs=43556)
                0.09375 = fieldNorm(doc=1027)
          0.08136322 = weight(abstract_txt:considerations in 1027) [ClassicSimilarity], result of:
            0.08136322 = score(doc=1027,freq=1.0), product of:
              0.13269967 = queryWeight, product of:
                1.0546632 = boost
                6.540139 = idf(docFreq=170, maxDocs=43556)
                0.019238405 = queryNorm
              0.6131381 = fieldWeight in 1027, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.540139 = idf(docFreq=170, maxDocs=43556)
                0.09375 = fieldNorm(doc=1027)
          0.021558417 = weight(abstract_txt:information in 1027) [ClassicSimilarity], result of:
            0.021558417 = score(doc=1027,freq=3.0), product of:
              0.054743055 = queryWeight, product of:
                1.1732863 = boost
                2.425247 = idf(docFreq=10472, maxDocs=43556)
                0.019238405 = queryNorm
              0.393811 = fieldWeight in 1027, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.425247 = idf(docFreq=10472, maxDocs=43556)
                0.09375 = fieldNorm(doc=1027)
          0.1397101 = weight(abstract_txt:reformulation in 1027) [ClassicSimilarity], result of:
            0.1397101 = score(doc=1027,freq=1.0), product of:
              0.19028431 = queryWeight, product of:
                1.2629331 = boost
                7.831655 = idf(docFreq=46, maxDocs=43556)
                0.019238405 = queryNorm
              0.73421764 = fieldWeight in 1027, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.831655 = idf(docFreq=46, maxDocs=43556)
                0.09375 = fieldNorm(doc=1027)
          0.06897273 = weight(abstract_txt:retrieval in 1027) [ClassicSimilarity], result of:
            0.06897273 = score(doc=1027,freq=2.0), product of:
              0.14975433 = queryWeight, product of:
                2.2407765 = boost
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.019238405 = queryNorm
              0.46057254 = fieldWeight in 1027, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.09375 = fieldNorm(doc=1027)
          0.09656278 = weight(abstract_txt:text in 1027) [ClassicSimilarity], result of:
            0.09656278 = score(doc=1027,freq=1.0), product of:
              0.2543593 = queryWeight, product of:
                3.2650337 = boost
                4.0494018 = idf(docFreq=2063, maxDocs=43556)
                0.019238405 = queryNorm
              0.3796314 = fieldWeight in 1027, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0494018 = idf(docFreq=2063, maxDocs=43556)
                0.09375 = fieldNorm(doc=1027)
        0.24 = coord(6/25)
    
  5. Barrio, P.; Gravano, L.: Sampling strategies for information extraction over the deep web (2017) 0.09
    0.0931593 = sum of:
      0.0931593 = product of:
        0.4657965 = sum of:
          0.12137376 = weight(abstract_txt:extraction in 410) [ClassicSimilarity], result of:
            0.12137376 = score(doc=410,freq=9.0), product of:
              0.1193005 = queryWeight, product of:
                6.201164 = idf(docFreq=239, maxDocs=43556)
                0.019238405 = queryNorm
              1.0173784 = fieldWeight in 410, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.201164 = idf(docFreq=239, maxDocs=43556)
                0.0546875 = fieldNorm(doc=410)
          0.020536102 = weight(abstract_txt:information in 410) [ClassicSimilarity], result of:
            0.020536102 = score(doc=410,freq=8.0), product of:
              0.054743055 = queryWeight, product of:
                1.1732863 = boost
                2.425247 = idf(docFreq=10472, maxDocs=43556)
                0.019238405 = queryNorm
              0.37513623 = fieldWeight in 410, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                2.425247 = idf(docFreq=10472, maxDocs=43556)
                0.0546875 = fieldNorm(doc=410)
          0.1346219 = weight(abstract_txt:execution in 410) [ClassicSimilarity], result of:
            0.1346219 = score(doc=410,freq=2.0), product of:
              0.21104434 = queryWeight, product of:
                1.3300431 = boost
                8.247815 = idf(docFreq=30, maxDocs=43556)
                0.019238405 = queryNorm
              0.63788444 = fieldWeight in 410, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.247815 = idf(docFreq=30, maxDocs=43556)
                0.0546875 = fieldNorm(doc=410)
          0.040234093 = weight(abstract_txt:retrieval in 410) [ClassicSimilarity], result of:
            0.040234093 = score(doc=410,freq=2.0), product of:
              0.14975433 = queryWeight, product of:
                2.2407765 = boost
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.019238405 = queryNorm
              0.2686673 = fieldWeight in 410, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.0546875 = fieldNorm(doc=410)
          0.14903066 = weight(abstract_txt:text in 410) [ClassicSimilarity], result of:
            0.14903066 = score(doc=410,freq=7.0), product of:
              0.2543593 = queryWeight, product of:
                3.2650337 = boost
                4.0494018 = idf(docFreq=2063, maxDocs=43556)
                0.019238405 = queryNorm
              0.585906 = fieldWeight in 410, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.0494018 = idf(docFreq=2063, maxDocs=43556)
                0.0546875 = fieldNorm(doc=410)
        0.2 = coord(5/25)