Search (4 results, page 1 of 1)

Huang, H.; Jörgensen, C.: Characterizing user tagging and Co-occurring metadata in general and specialized metadata collections (2013) 0.00
```
0.0020714647 = product of:
  0.0041429293 = sum of:
    0.0041429293 = product of:
      0.008285859 = sum of:
        0.008285859 = weight(_text_:a in 1046) [ClassicSimilarity], result of:
          0.008285859 = score(doc=1046,freq=12.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.15602624 = fieldWeight in 1046, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1046)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

This study aims to identify the categorical characteristics and usage patterns of the most popular image tags in Flickr. The "metadata usage ratio" is introduced as a means of assessing the usage of a popular tag as metadata. We also compare how popular tags are used as image tags or metadata in the Flickr general collection and the Library of Congress's photostream (LCP), also in Flickr. The Flickr popular tags in the list overall are categorically stable, and the changes that do appear reflect Flickr users' evolving technology-driven cultural experience. The popular tags in Flickr had a high number of generic objects and specific locations-related tags and were rarely at the abstract level. Conversely, the popular tags in the LCP describe more in the specific objects and time categories. Flickr users copied the Library of Congress-supplied metadata that related to specific objects or events and standard bibliographic information (e.g., author, format, time references) as popular tags in the LCP. Those popular tags related to generic objects and events showed a high metadata usage ratio, while those related to specific locations and objects showed a low image metadata usage ratio. Popular tags in Flickr appeared less frequently as image metadata when describing specific objects than specific times and locations for historical images in Flickr LCP collections. Understanding how people contribute image tags or image metadata in Flickr helps determine what users need to describe and query images, and could help improve image browsing and retrieval.

Type

a
Jörgensen, C.; Stvilia, B.; Wu, S.: Assessing the relationships among tag syntax, semantics, and perceived usefulness (2014) 0.00
```
0.001757696 = product of:
  0.003515392 = sum of:
    0.003515392 = product of:
      0.007030784 = sum of:
        0.007030784 = weight(_text_:a in 1244) [ClassicSimilarity], result of:
          0.007030784 = score(doc=1244,freq=6.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.13239266 = fieldWeight in 1244, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=1244)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

With the recent interest in socially created metadata as a potentially complementary resource for image description in relation to established tools such as thesauri and other forms of controlled vocabulary, questions remain about the quality and reuse value of these metadata. This study describes and examines a set of tags using quantitative and qualitative methods and assesses relationships among categories of image tags, tag assignment order, and users' perceptions of usefulness of index terms and user-contributed tags. The study found that tags provide much descriptive information about an image but that users also value and trust controlled vocabulary terms. The study found no correlation between tag length and assignment order, and tag length and its perceived usefulness. The findings of this study can contribute to the design of controlled vocabularies, indexing processes, and retrieval systems for images. In particular, the findings of the study can advance the understanding of image tagging practices, tag facet/category distributions, relative usefulness and importance of these categories to the user, and potential mechanisms for identifying useful terms.

Type

a
Stvilia, B.; Jörgensen, C.: Member activities and quality of tags in a collection of historical photographs in Flickr (2010) 0.00
```
0.0016913437 = product of:
  0.0033826875 = sum of:
    0.0033826875 = product of:
      0.006765375 = sum of:
        0.006765375 = weight(_text_:a in 4117) [ClassicSimilarity], result of:
          0.006765375 = score(doc=4117,freq=8.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.12739488 = fieldWeight in 4117, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4117)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

To enable and guide effective metadata creation it is essential to understand the structure and patterns of the activities of the community around the photographs, resources used, and scale and quality of the socially created metadata relative to the metadata and knowledge already encoded in existing knowledge organization systems. This article presents an analysis of Flickr member discussions around the photographs of the Library of Congress photostream in Flickr. The article also reports on an analysis of the intrinsic and relational quality of the photostream tags relative to two knowledge organization systems: the Thesaurus for Graphic Materials (TGM) and the Library of Congress Subject Headings (LCSH). Thirty seven percent of the original tag set and 15.3% of the preprocessed set (after the removal of tags with fewer than three characters and URLs) were invalid or misspelled terms. Nouns, named entity terms, and complex terms constituted approximately 77% of the preprocessed set. More than a half of the photostream tags were not found in the TGM and LCSH, and more than a quarter of those terms were regular nouns and noun phrases. This suggests that these terms could be complimentary to more traditional methods of indexing using controlled vocabularies.

Type

a
Huang, H.; Stvilia, B.; Jörgensen, C.; Bass, H.W.: Prioritization of data quality dimensions and skills requirements in genome annotation work (2012) 0.00
```
0.0014647468 = product of:
  0.0029294936 = sum of:
    0.0029294936 = product of:
      0.005858987 = sum of:
        0.005858987 = weight(_text_:a in 4971) [ClassicSimilarity], result of:
          0.005858987 = score(doc=4971,freq=6.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.11032722 = fieldWeight in 4971, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4971)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

The rapid accumulation of genome annotations, as well as their widespread reuse in clinical and scientific practice, poses new challenges to management of the quality of scientific data. This study contributes towards better understanding of scientists' perceptions of and priorities for data quality and data quality assurance skills needed in genome annotation. This study was guided by a previously developed general framework for assessment of data quality and by a taxonomy of data-quality (DQ) skills, and intended to define context-sensitive models of criteria for data quality and skills for genome annotation. Analysis of the results revealed that genomics scientists recognize specific sets of criteria for quality in the genome-annotation context. Seventeen data quality dimensions were reduced to 5-factor constructs, and 17 relevant skills were grouped into 4-factor constructs. The constructs defined by this study advance the understanding of data quality relationships and are an important contribution to data and information quality research. In addition, the resulting models can serve as valuable resources to genome data curators and administrators for developing data-curation policies and designing DQ-assurance strategies, processes, procedures, and infrastructure. The study's findings may also inform educators in developing data quality assurance curricula and training courses.

Type

a

Search (4 results, page 1 of 1)

Authors

Themes