Document (#17765)

Author
Srihari, R.K.
Title
Using speech input for image interpretation, annotation, and retrieval
Source
Digital image access and retrieval: Proceedings of the 1996 Clinic on Library Applications of Data Processing, 24-26 Mar 1996. Ed.: P.B. Heidorn u. B. Sandore
Imprint
Urbana-Champaign, IL : Illinois University at Urbana-Champaign, Department of Library and Information Science
Year
1997
Pages
S.140-156
Abstract
Explores the interaction of textual and photographic information in an integrated text and image database environment and describes 3 different applications involving the exploitation of linguistic context in vision. Describes the practical application of these ideas in working systems. PICTION uses captions to identify human faces in a photograph, wile Show&Tell is a multimedia system for semi automatic image annotation. The system combines advances in speech recognition, natural language processing and image understanding to assist in image annotation and enhance image retrieval capabilities. Presents an extension of this work to video annotation and retrieval
Theme
Sprachretrieval
Form
Bilder
Object
PICTION
Show&Tell

Similar documents (content)

  1. Chen, J.; Wang, D.; Xie, I.; Lu, Q.: Image annotation tactics : transitions, strategies and efficiency (2018) 0.20
    0.19815929 = sum of:
      0.19815929 = product of:
        1.2384956 = sum of:
          0.045540273 = weight(abstract_txt:interpretation in 5046) [ClassicSimilarity], result of:
            0.045540273 = score(doc=5046,freq=2.0), product of:
              0.09884577 = queryWeight, product of:
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.016592951 = queryNorm
              0.4607205 = fieldWeight in 5046, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5046)
          0.03587386 = weight(abstract_txt:involving in 5046) [ClassicSimilarity], result of:
            0.03587386 = score(doc=5046,freq=1.0), product of:
              0.106224105 = queryWeight, product of:
                1.0366508 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.016592951 = queryNorm
              0.33771864 = fieldWeight in 5046, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5046)
          0.7314627 = weight(abstract_txt:annotation in 5046) [ClassicSimilarity], result of:
            0.7314627 = score(doc=5046,freq=12.0), product of:
              0.54969954 = queryWeight, product of:
                4.7164326 = boost
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.016592951 = queryNorm
              1.3306592 = fieldWeight in 5046, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5046)
          0.4256188 = weight(abstract_txt:image in 5046) [ClassicSimilarity], result of:
            0.4256188 = score(doc=5046,freq=9.0), product of:
              0.48271167 = queryWeight, product of:
                5.4130306 = boost
                5.374322 = idf(docFreq=556, maxDocs=44218)
                0.016592951 = queryNorm
              0.8817247 = fieldWeight in 5046, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                5.374322 = idf(docFreq=556, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5046)
        0.16 = coord(4/25)
    
  2. ISLIP Media introduces a system for searching digital video and audio libraries (1998) 0.19
    0.18617141 = sum of:
      0.18617141 = product of:
        0.7757142 = sum of:
          0.07984601 = weight(abstract_txt:recognition in 1513) [ClassicSimilarity], result of:
            0.07984601 = score(doc=1513,freq=1.0), product of:
              0.10435787 = queryWeight, product of:
                1.0275041 = boost
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.016592951 = queryNorm
              0.7651173 = fieldWeight in 1513, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.125 = fieldNorm(doc=1513)
          0.08029408 = weight(abstract_txt:video in 1513) [ClassicSimilarity], result of:
            0.08029408 = score(doc=1513,freq=1.0), product of:
              0.10474792 = queryWeight, product of:
                1.0294225 = boost
                6.1323667 = idf(docFreq=260, maxDocs=44218)
                0.016592951 = queryNorm
              0.76654583 = fieldWeight in 1513, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1323667 = idf(docFreq=260, maxDocs=44218)
                0.125 = fieldNorm(doc=1513)
          0.026706135 = weight(abstract_txt:system in 1513) [ClassicSimilarity], result of:
            0.026706135 = score(doc=1513,freq=1.0), product of:
              0.063353956 = queryWeight, product of:
                1.1321992 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.016592951 = queryNorm
              0.42153856 = fieldWeight in 1513, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.125 = fieldNorm(doc=1513)
          0.03894039 = weight(abstract_txt:describes in 1513) [ClassicSimilarity], result of:
            0.03894039 = score(doc=1513,freq=1.0), product of:
              0.08146416 = queryWeight, product of:
                1.2838646 = boost
                3.8240511 = idf(docFreq=2624, maxDocs=44218)
                0.016592951 = queryNorm
              0.4780064 = fieldWeight in 1513, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8240511 = idf(docFreq=2624, maxDocs=44218)
                0.125 = fieldNorm(doc=1513)
          0.22564663 = weight(abstract_txt:speech in 1513) [ClassicSimilarity], result of:
            0.22564663 = score(doc=1513,freq=1.0), product of:
              0.26281628 = queryWeight, product of:
                2.3060148 = boost
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.016592951 = queryNorm
              0.8585717 = fieldWeight in 1513, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.125 = fieldNorm(doc=1513)
          0.32428098 = weight(abstract_txt:image in 1513) [ClassicSimilarity], result of:
            0.32428098 = score(doc=1513,freq=1.0), product of:
              0.48271167 = queryWeight, product of:
                5.4130306 = boost
                5.374322 = idf(docFreq=556, maxDocs=44218)
                0.016592951 = queryNorm
              0.67179024 = fieldWeight in 1513, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.374322 = idf(docFreq=556, maxDocs=44218)
                0.125 = fieldNorm(doc=1513)
        0.24 = coord(6/25)
    
  3. Starostenko, O.; Rodríguez-Asomoza, J.; Sénchez-López, S.E.; Chévez-Aragón, J.A.: Shape indexing and retrieval : a hybrid approach using ontological description (2008) 0.18
    0.17646347 = sum of:
      0.17646347 = product of:
        0.7352645 = sum of:
          0.046002623 = weight(abstract_txt:interpretation in 4318) [ClassicSimilarity], result of:
            0.046002623 = score(doc=4318,freq=1.0), product of:
              0.09884577 = queryWeight, product of:
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.016592951 = queryNorm
              0.46539798 = fieldWeight in 4318, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.078125 = fieldNorm(doc=4318)
          0.06548258 = weight(abstract_txt:textual in 4318) [ClassicSimilarity], result of:
            0.06548258 = score(doc=4318,freq=2.0), product of:
              0.09927584 = queryWeight, product of:
                1.0021731 = boost
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.016592951 = queryNorm
              0.65960234 = fieldWeight in 4318, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.078125 = fieldNorm(doc=4318)
          0.063181564 = weight(abstract_txt:combines in 4318) [ClassicSimilarity], result of:
            0.063181564 = score(doc=4318,freq=1.0), product of:
              0.12213214 = queryWeight, product of:
                1.1115677 = boost
                6.6217136 = idf(docFreq=159, maxDocs=44218)
                0.016592951 = queryNorm
              0.51732135 = fieldWeight in 4318, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6217136 = idf(docFreq=159, maxDocs=44218)
                0.078125 = fieldNorm(doc=4318)
          0.016691335 = weight(abstract_txt:system in 4318) [ClassicSimilarity], result of:
            0.016691335 = score(doc=4318,freq=1.0), product of:
              0.063353956 = queryWeight, product of:
                1.1321992 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.016592951 = queryNorm
              0.2634616 = fieldWeight in 4318, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.078125 = fieldNorm(doc=4318)
          0.047454536 = weight(abstract_txt:retrieval in 4318) [ClassicSimilarity], result of:
            0.047454536 = score(doc=4318,freq=3.0), product of:
              0.1009148 = queryWeight, product of:
                1.7500844 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.016592951 = queryNorm
              0.47024357 = fieldWeight in 4318, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=4318)
          0.49645185 = weight(abstract_txt:image in 4318) [ClassicSimilarity], result of:
            0.49645185 = score(doc=4318,freq=6.0), product of:
              0.48271167 = queryWeight, product of:
                5.4130306 = boost
                5.374322 = idf(docFreq=556, maxDocs=44218)
                0.016592951 = queryNorm
              1.0284646 = fieldWeight in 4318, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.374322 = idf(docFreq=556, maxDocs=44218)
                0.078125 = fieldNorm(doc=4318)
        0.24 = coord(6/25)
    
  4. Broadhurst, R.N.: Caere PageKeeper (1993) 0.17
    0.16939346 = sum of:
      0.16939346 = product of:
        1.0587091 = sum of:
          0.07969814 = weight(abstract_txt:input in 6304) [ClassicSimilarity], result of:
            0.07969814 = score(doc=6304,freq=1.0), product of:
              0.10422898 = queryWeight, product of:
                1.0268693 = boost
                6.1171575 = idf(docFreq=264, maxDocs=44218)
                0.016592951 = queryNorm
              0.7646447 = fieldWeight in 6304, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1171575 = idf(docFreq=264, maxDocs=44218)
                0.125 = fieldNorm(doc=6304)
          0.03776818 = weight(abstract_txt:system in 6304) [ClassicSimilarity], result of:
            0.03776818 = score(doc=6304,freq=2.0), product of:
              0.063353956 = queryWeight, product of:
                1.1321992 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.016592951 = queryNorm
              0.5961456 = fieldWeight in 6304, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.125 = fieldNorm(doc=6304)
          0.4826402 = weight(abstract_txt:annotation in 6304) [ClassicSimilarity], result of:
            0.4826402 = score(doc=6304,freq=1.0), product of:
              0.54969954 = queryWeight, product of:
                4.7164326 = boost
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.016592951 = queryNorm
              0.8780073 = fieldWeight in 6304, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.125 = fieldNorm(doc=6304)
          0.45860258 = weight(abstract_txt:image in 6304) [ClassicSimilarity], result of:
            0.45860258 = score(doc=6304,freq=2.0), product of:
              0.48271167 = queryWeight, product of:
                5.4130306 = boost
                5.374322 = idf(docFreq=556, maxDocs=44218)
                0.016592951 = queryNorm
              0.9500549 = fieldWeight in 6304, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.374322 = idf(docFreq=556, maxDocs=44218)
                0.125 = fieldNorm(doc=6304)
        0.16 = coord(4/25)
    
  5. Wittbrock, M.J.; Hauptmann, A.G.: Speech recognition for a digital video library (1998) 0.17
    0.16508934 = sum of:
      0.16508934 = product of:
        0.6878723 = sum of:
          0.07984601 = weight(abstract_txt:recognition in 873) [ClassicSimilarity], result of:
            0.07984601 = score(doc=873,freq=4.0), product of:
              0.10435787 = queryWeight, product of:
                1.0275041 = boost
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.016592951 = queryNorm
              0.7651173 = fieldWeight in 873, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.0625 = fieldNorm(doc=873)
          0.08977152 = weight(abstract_txt:video in 873) [ClassicSimilarity], result of:
            0.08977152 = score(doc=873,freq=5.0), product of:
              0.10474792 = queryWeight, product of:
                1.0294225 = boost
                6.1323667 = idf(docFreq=260, maxDocs=44218)
                0.016592951 = queryNorm
              0.8570243 = fieldWeight in 873, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.1323667 = idf(docFreq=260, maxDocs=44218)
                0.0625 = fieldNorm(doc=873)
          0.019470194 = weight(abstract_txt:describes in 873) [ClassicSimilarity], result of:
            0.019470194 = score(doc=873,freq=1.0), product of:
              0.08146416 = queryWeight, product of:
                1.2838646 = boost
                3.8240511 = idf(docFreq=2624, maxDocs=44218)
                0.016592951 = queryNorm
              0.2390032 = fieldWeight in 873, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8240511 = idf(docFreq=2624, maxDocs=44218)
                0.0625 = fieldNorm(doc=873)
          0.043836623 = weight(abstract_txt:retrieval in 873) [ClassicSimilarity], result of:
            0.043836623 = score(doc=873,freq=4.0), product of:
              0.1009148 = queryWeight, product of:
                1.7500844 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.016592951 = queryNorm
              0.43439242 = fieldWeight in 873, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=873)
          0.22564663 = weight(abstract_txt:speech in 873) [ClassicSimilarity], result of:
            0.22564663 = score(doc=873,freq=4.0), product of:
              0.26281628 = queryWeight, product of:
                2.3060148 = boost
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.016592951 = queryNorm
              0.8585717 = fieldWeight in 873, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.0625 = fieldNorm(doc=873)
          0.22930129 = weight(abstract_txt:image in 873) [ClassicSimilarity], result of:
            0.22930129 = score(doc=873,freq=2.0), product of:
              0.48271167 = queryWeight, product of:
                5.4130306 = boost
                5.374322 = idf(docFreq=556, maxDocs=44218)
                0.016592951 = queryNorm
              0.47502744 = fieldWeight in 873, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.374322 = idf(docFreq=556, maxDocs=44218)
                0.0625 = fieldNorm(doc=873)
        0.24 = coord(6/25)