Search (2 results, page 1 of 1)

Benitez, A.B.; Zhong, D.; Chang, S.-F.: Enabling MPEG-7 structural and semantic descriptions in retrieval applications (2007) 0.02
```
0.020225393 = product of:
  0.040450785 = sum of:
    0.040450785 = product of:
      0.08090157 = sum of:
        0.08090157 = weight(_text_:e.g in 518) [ClassicSimilarity], result of:
          0.08090157 = score(doc=518,freq=2.0), product of:
            0.23393378 = queryWeight, product of:
              5.2168427 = idf(docFreq=651, maxDocs=44218)
              0.044842023 = queryNorm
            0.34583107 = fieldWeight in 518, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.2168427 = idf(docFreq=651, maxDocs=44218)
              0.046875 = fieldNorm(doc=518)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

The MPEG-7 standard supports the description of both the structure and the semantics of multimedia; however, the generation and consumption of MPEG-7 structural and semantic descriptions are outside the scope of the standard. This article presents two research prototype systems that demonstrate the generation and consumption of MPEG-7 structural and semantic descriptions in retrieval applications. The active system for MPEG-4 video object simulation (AMOS) is a video object segmentation and retrieval system that segments, tracks, and models objects in videos (e.g., person, car) as a set of regions with corresponding visual features and spatiotemporal relations. The region-based model provides an effective base for similarity retrieval of video objects. The second system, the Intelligent Multimedia Knowledge Application (IMKA), uses the novel MediaNet framework for representing semantic and perceptual information about the world using multimedia. MediaNet knowledge bases can be constructed automatically from annotated collections of multimedia data and used to enhance the retrieval of multimedia.
Jörgensen, C.; Jaimes, A.; Benitez, A.B.; Chang, S.-F.: ¬A conceptual framework and empirical research for classifying visual descriptors (2001) 0.02
```
0.016854495 = product of:
  0.03370899 = sum of:
    0.03370899 = product of:
      0.06741798 = sum of:
        0.06741798 = weight(_text_:e.g in 6532) [ClassicSimilarity], result of:
          0.06741798 = score(doc=6532,freq=2.0), product of:
            0.23393378 = queryWeight, product of:
              5.2168427 = idf(docFreq=651, maxDocs=44218)
              0.044842023 = queryNorm
            0.28819257 = fieldWeight in 6532, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.2168427 = idf(docFreq=651, maxDocs=44218)
              0.0390625 = fieldNorm(doc=6532)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

This article presents exploratory research evaluating a conceptual structure for the description of visual content of images. The structure, which was developed from empirical research in several fields (e.g., Computer Science, Psychology, Information Studies, etc.), classifies visual attributes into a "Pyramid" containing four syntactic levels (type/technique, global distribution, local structure, composition), and six semantic levels (generic, specific, and abstract levels of both object and scene, respectively). Various experiments are presented, which address the Pyramid's ability to achieve several tasks: (1) classification of terms describing image attributes generated in a formal and an informal description task, (2) classification of terms that result from a structured approach to indexing, and (3) guidance in the indexing process. Several descriptions, generated by naive users and indexers, are used in experiments that include two image collections: a random Web sample, and a set of news images. To test descriptions generated in a structured setting, an Image Indexing Template (developed independently over several years of this project by one of the authors) was also used. The experiments performed suggest that the Pyramid is conceptually robust (i.e., can accommodate a full range of attributes), and that it can be used to organize visual content for retrieval, to guide the indexing process, and to classify descriptions obtained manually and automatically

Search (2 results, page 1 of 1)

Authors