Search (9 results, page 1 of 1)

  • × theme_ss:"Sprachretrieval"
  • × type_ss:"a"
  • × year_i:[1990 TO 2000}
  1. Srihari, R.K.: Using speech input for image interpretation, annotation, and retrieval (1997) 0.05
    0.051782876 = product of:
      0.10356575 = sum of:
        0.06671549 = weight(_text_:processing in 764) [ClassicSimilarity], result of:
          0.06671549 = score(doc=764,freq=4.0), product of:
            0.175792 = queryWeight, product of:
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.043425296 = queryNorm
            0.3795138 = fieldWeight in 764, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.046875 = fieldNorm(doc=764)
        0.036850262 = product of:
          0.05527539 = sum of:
            0.019974224 = weight(_text_:science in 764) [ClassicSimilarity], result of:
              0.019974224 = score(doc=764,freq=2.0), product of:
                0.11438741 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.043425296 = queryNorm
                0.17461908 = fieldWeight in 764, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.046875 = fieldNorm(doc=764)
            0.035301168 = weight(_text_:22 in 764) [ClassicSimilarity], result of:
              0.035301168 = score(doc=764,freq=2.0), product of:
                0.15206799 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.043425296 = queryNorm
                0.23214069 = fieldWeight in 764, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=764)
          0.6666667 = coord(2/3)
      0.5 = coord(2/4)
    
    Abstract
    Explores the interaction of textual and photographic information in an integrated text and image database environment and describes 3 different applications involving the exploitation of linguistic context in vision. Describes the practical application of these ideas in working systems. PICTION uses captions to identify human faces in a photograph, wile Show&Tell is a multimedia system for semi automatic image annotation. The system combines advances in speech recognition, natural language processing and image understanding to assist in image annotation and enhance image retrieval capabilities. Presents an extension of this work to video annotation and retrieval
    Date
    22. 9.1997 19:16:05
    Imprint
    Urbana-Champaign, IL : Illinois University at Urbana-Champaign, Department of Library and Information Science
    Source
    Digital image access and retrieval: Proceedings of the 1996 Clinic on Library Applications of Data Processing, 24-26 Mar 1996. Ed.: P.B. Heidorn u. B. Sandore
  2. Wittbrock, M.J.; Hauptmann, A.G.: Speech recognition for a digital video library (1998) 0.04
    0.0368198 = product of:
      0.0736396 = sum of:
        0.068091206 = weight(_text_:processing in 873) [ClassicSimilarity], result of:
          0.068091206 = score(doc=873,freq=6.0), product of:
            0.175792 = queryWeight, product of:
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.043425296 = queryNorm
            0.38733965 = fieldWeight in 873, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.0390625 = fieldNorm(doc=873)
        0.005548396 = product of:
          0.016645188 = sum of:
            0.016645188 = weight(_text_:science in 873) [ClassicSimilarity], result of:
              0.016645188 = score(doc=873,freq=2.0), product of:
                0.11438741 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.043425296 = queryNorm
                0.1455159 = fieldWeight in 873, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=873)
          0.33333334 = coord(1/3)
      0.5 = coord(2/4)
    
    Abstract
    The standard method for making the full content of audio and video material searchable is to annotate it with human-generated meta-data that describes the content in a way that search can understand, as is done in the creation of multimedia CD-ROMs. However, for the huge amounts of data that could usefully be included in digital video and audio libraries, the cost of producing the meta-data is prohibitive. In the Informedia Digital Video Library, the production of the meta-data supporting the library interface is automated using techniques derived from artificial intelligence (AI) research. By applying speech recognition together with natural language processing, information retrieval, and image analysis, an interface has been prduced that helps users locate the information they want, and navigate or browse the digital video library more effectively. Specific interface components include automatc titles, filmstrips, video skims, word location marking, and representative frames for shots. Both the user interface and the information retrieval engine within Informedia are designed for use with automatically derived meta-data, much of which depends on speech recognition for its production. Some experimental information retrieval results will be given, supporting a basic premise of the Informedia project: That speech recognition generated transcripts can make multimedia material searchable. The Informedia project emphasizes the integration of speech recognition, image processing, natural language processing, and information retrieval to compensate for deficiencies in these individual technologies
    Source
    Journal of the American Society for Information Science. 49(1998) no.7, S.619-632
  3. Keller, F.: How do humans deal with ungrammatical input? : Experimental evidence and computational modelling (1996) 0.02
    0.023587486 = product of:
      0.09434994 = sum of:
        0.09434994 = weight(_text_:processing in 7293) [ClassicSimilarity], result of:
          0.09434994 = score(doc=7293,freq=2.0), product of:
            0.175792 = queryWeight, product of:
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.043425296 = queryNorm
            0.53671354 = fieldWeight in 7293, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.09375 = fieldNorm(doc=7293)
      0.25 = coord(1/4)
    
    Source
    Natural language processing and speech technology: Results of the 3rd KONVENS Conference, Bielefeld, October 1996. Ed.: D. Gibbon
  4. Marx, J.: ¬Die '¬Computer-Talk-These' in der Sprachgenerierung : Hinweise zur Gestaltung natürlichsprachlicher Zustandsanzeigen in multimodalen Informationssystemen (1996) 0.02
    0.023587486 = product of:
      0.09434994 = sum of:
        0.09434994 = weight(_text_:processing in 7294) [ClassicSimilarity], result of:
          0.09434994 = score(doc=7294,freq=2.0), product of:
            0.175792 = queryWeight, product of:
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.043425296 = queryNorm
            0.53671354 = fieldWeight in 7294, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.09375 = fieldNorm(doc=7294)
      0.25 = coord(1/4)
    
    Source
    Natural language processing and speech technology: Results of the 3rd KONVENS Conference, Bielefeld, October 1996. Ed.: D. Gibbon
  5. Schultz, T.; Soltau, H.: Automatische Identifizierung spontan gesprochener Sprachen mit neuronalen Netzen (1996) 0.02
    0.023587486 = product of:
      0.09434994 = sum of:
        0.09434994 = weight(_text_:processing in 7295) [ClassicSimilarity], result of:
          0.09434994 = score(doc=7295,freq=2.0), product of:
            0.175792 = queryWeight, product of:
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.043425296 = queryNorm
            0.53671354 = fieldWeight in 7295, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.09375 = fieldNorm(doc=7295)
      0.25 = coord(1/4)
    
    Source
    Natural language processing and speech technology: Results of the 3rd KONVENS Conference, Bielefeld, October 1996. Ed.: D. Gibbon
  6. Thompson, L.A.; Ogden, W.C.: Visible speech improves human language understanding : implications for speech processing systems (1995) 0.02
    0.022238497 = product of:
      0.08895399 = sum of:
        0.08895399 = weight(_text_:processing in 3883) [ClassicSimilarity], result of:
          0.08895399 = score(doc=3883,freq=4.0), product of:
            0.175792 = queryWeight, product of:
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.043425296 = queryNorm
            0.5060184 = fieldWeight in 3883, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.0625 = fieldNorm(doc=3883)
      0.25 = coord(1/4)
    
    Abstract
    Presents evidence from the study of human language understanding suggesting that the ability to perceive visible speech can greatly influence the ability to understand and remember spoken language. A view of the speaker's face can greatly aid in the perception of ambiguous or noisy speech and can aid cognitive processing of speech leading to better understanding and recall. Some of these effects have been replaced using computer synthesized visual and auditory speech. When giving an interface a voice, it may be best to give it a face too
  7. Sparck Jones, K.; Jones, G.J.F.; Foote, J.T.; Young, S.J.: Experiments in spoken document retrieval (1996) 0.01
    0.013759367 = product of:
      0.05503747 = sum of:
        0.05503747 = weight(_text_:processing in 1951) [ClassicSimilarity], result of:
          0.05503747 = score(doc=1951,freq=2.0), product of:
            0.175792 = queryWeight, product of:
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.043425296 = queryNorm
            0.3130829 = fieldWeight in 1951, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1951)
      0.25 = coord(1/4)
    
    Source
    Information processing and management. 32(1996) no.4, S.399-417
  8. Young, C.W.; Eastman, C.M.; Oakman, R.L.: ¬An analysis of ill-formed input in natural language queries to document retrieval systems (1991) 0.01
    0.011793743 = product of:
      0.04717497 = sum of:
        0.04717497 = weight(_text_:processing in 5263) [ClassicSimilarity], result of:
          0.04717497 = score(doc=5263,freq=2.0), product of:
            0.175792 = queryWeight, product of:
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.043425296 = queryNorm
            0.26835677 = fieldWeight in 5263, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.046875 = fieldNorm(doc=5263)
      0.25 = coord(1/4)
    
    Source
    Information processing and management. 27(1991) no.6, S.615-622
  9. Lange, H.R.: Speech synthesis and speech recognition : tomorrow's human-computer interface? (1993) 0.00
    0.0022193585 = product of:
      0.008877434 = sum of:
        0.008877434 = product of:
          0.0266323 = sum of:
            0.0266323 = weight(_text_:science in 7224) [ClassicSimilarity], result of:
              0.0266323 = score(doc=7224,freq=2.0), product of:
                0.11438741 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.043425296 = queryNorm
                0.23282544 = fieldWeight in 7224, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.0625 = fieldNorm(doc=7224)
          0.33333334 = coord(1/3)
      0.25 = coord(1/4)
    
    Source
    Annual review of information science and technology. 28(1993), S.153-185