Search (4 results, page 1 of 1)

  • × theme_ss:"Sprachretrieval"
  1. Srihari, R.K.: Using speech input for image interpretation, annotation, and retrieval (1997) 0.04
    0.035128992 = product of:
      0.10538697 = sum of:
        0.10538697 = sum of:
          0.06711477 = weight(_text_:multimedia in 764) [ClassicSimilarity], result of:
            0.06711477 = score(doc=764,freq=2.0), product of:
              0.21832302 = queryWeight, product of:
                4.6372695 = idf(docFreq=1163, maxDocs=44218)
                0.04708008 = queryNorm
              0.30741042 = fieldWeight in 764, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6372695 = idf(docFreq=1163, maxDocs=44218)
                0.046875 = fieldNorm(doc=764)
          0.038272206 = weight(_text_:22 in 764) [ClassicSimilarity], result of:
            0.038272206 = score(doc=764,freq=2.0), product of:
              0.16486642 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.04708008 = queryNorm
              0.23214069 = fieldWeight in 764, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.046875 = fieldNorm(doc=764)
      0.33333334 = coord(1/3)
    
    Abstract
    Explores the interaction of textual and photographic information in an integrated text and image database environment and describes 3 different applications involving the exploitation of linguistic context in vision. Describes the practical application of these ideas in working systems. PICTION uses captions to identify human faces in a photograph, wile Show&Tell is a multimedia system for semi automatic image annotation. The system combines advances in speech recognition, natural language processing and image understanding to assist in image annotation and enhance image retrieval capabilities. Presents an extension of this work to video annotation and retrieval
    Date
    22. 9.1997 19:16:05
  2. Wittbrock, M.J.; Hauptmann, A.G.: Speech recognition for a digital video library (1998) 0.01
    0.013182588 = product of:
      0.039547764 = sum of:
        0.039547764 = product of:
          0.07909553 = sum of:
            0.07909553 = weight(_text_:multimedia in 873) [ClassicSimilarity], result of:
              0.07909553 = score(doc=873,freq=4.0), product of:
                0.21832302 = queryWeight, product of:
                  4.6372695 = idf(docFreq=1163, maxDocs=44218)
                  0.04708008 = queryNorm
                0.3622867 = fieldWeight in 873, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.6372695 = idf(docFreq=1163, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=873)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    The standard method for making the full content of audio and video material searchable is to annotate it with human-generated meta-data that describes the content in a way that search can understand, as is done in the creation of multimedia CD-ROMs. However, for the huge amounts of data that could usefully be included in digital video and audio libraries, the cost of producing the meta-data is prohibitive. In the Informedia Digital Video Library, the production of the meta-data supporting the library interface is automated using techniques derived from artificial intelligence (AI) research. By applying speech recognition together with natural language processing, information retrieval, and image analysis, an interface has been prduced that helps users locate the information they want, and navigate or browse the digital video library more effectively. Specific interface components include automatc titles, filmstrips, video skims, word location marking, and representative frames for shots. Both the user interface and the information retrieval engine within Informedia are designed for use with automatically derived meta-data, much of which depends on speech recognition for its production. Some experimental information retrieval results will be given, supporting a basic premise of the Informedia project: That speech recognition generated transcripts can make multimedia material searchable. The Informedia project emphasizes the integration of speech recognition, image processing, natural language processing, and information retrieval to compensate for deficiencies in these individual technologies
  3. Sparck Jones, K.; Jones, G.J.F.; Foote, J.T.; Young, S.J.: Experiments in spoken document retrieval (1996) 0.01
    0.013050094 = product of:
      0.039150283 = sum of:
        0.039150283 = product of:
          0.078300565 = sum of:
            0.078300565 = weight(_text_:multimedia in 1951) [ClassicSimilarity], result of:
              0.078300565 = score(doc=1951,freq=2.0), product of:
                0.21832302 = queryWeight, product of:
                  4.6372695 = idf(docFreq=1163, maxDocs=44218)
                  0.04708008 = queryNorm
                0.3586455 = fieldWeight in 1951, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.6372695 = idf(docFreq=1163, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1951)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    Describes experiments in the retrieval of spoken documents in multimedia systems. Speech documents pose a particular problem for retrieval since their words as well as contents are unknown. Addresses this problem, for a video mail application, by combining state of the art speech recognition with established document retrieval technologies so as to provide an effective and efficient retrieval tool. Tests with a small spoken message collection show that retrieval precision for the spoken file can reach 90% of that obtained when the same file is used, as a benchmark, in text transcription form
  4. Rösener, C.: ¬Die Stecknadel im Heuhaufen : Natürlichsprachlicher Zugang zu Volltextdatenbanken (2005) 0.01
    0.01054607 = product of:
      0.03163821 = sum of:
        0.03163821 = product of:
          0.06327642 = sum of:
            0.06327642 = weight(_text_:multimedia in 548) [ClassicSimilarity], result of:
              0.06327642 = score(doc=548,freq=4.0), product of:
                0.21832302 = queryWeight, product of:
                  4.6372695 = idf(docFreq=1163, maxDocs=44218)
                  0.04708008 = queryNorm
                0.28982934 = fieldWeight in 548, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.6372695 = idf(docFreq=1163, maxDocs=44218)
                  0.03125 = fieldNorm(doc=548)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    RSWK
    Brockhaus-Enzyklopädie / Multimedia / Recherche
    Subject
    Brockhaus-Enzyklopädie / Multimedia / Recherche