Diese Datenbank enthält über 40.000 Dokumente zu Themen aus den Bereichen Formalerschließung – Inhaltserschließung – Information Retrieval.
© 2015 W. Gödert, TH Köln, Institut für Informationswissenschaft / Powered by litecat, BIS Oldenburg (Stand: 18. September 2018)
1Lim, S. ; Woo, J.R. ; Lee, J. ; Huh, S.-Y.: Consumer valuation of personal information in the age of big data.
In: Journal of the Association for Information Science and Technology. 69(2018) no.1, S.60-71.
Abstract: In a big data environment, there are growing concerns about the violation of consumer rights regarding information privacy. To induce rational regulations for protecting personal information, it is necessary to separately estimate consumers' values related to different types of personal information. In this article, discrete choice experiments using hypothetical information leakage situations given certain compensation amounts and discrete choice models were used to quantitatively analyze the value of personal information. The results indicate that consumers generally place high value on information that could cause immediate and actual damage from the leakage after identification, such as basic personal information and purchase list and payment information. Consumers value location information and personal medical information differently based on their perceived importance of privacy and their prior experience with personal information leakage. We suggest that the level of regulation should differ according to the type of personal information based on the consumers' valuation. This article contributes to a better understanding of a quantitative approach to pricing personal information.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23915/full.
2Cho, H. ; Donovan, A. ; Lee, J.H.: Art in an algorithm : a taxonomy for describing video game visual styles.
In: Journal of the Association for Information Science and Technology. 69(2018) no.5, S.633-646.
Abstract: The discovery and retrieval of video games in library and information systems is, by and large, dependent on a limited set of descriptive metadata. Noticeably missing from this metadata are classifications of visual style-despite the overwhelmingly visual nature of most video games and the interest in visual style among video game users. One explanation for this paucity is the difficulty in eliciting consistent judgements about visual style, likely due to subjective interpretations of terminology and a lack of demonstrable testing for coinciding judgements. This study presents a taxonomy of video game visual styles constructed from the findings of a 22-participant cataloging user study of visual styles. A detailed description of the study, and its value and shortcomings, are presented along with reflections about the challenges of cultivating consensus about visual style in video games. The high degree of overall agreement in the user study demonstrates the potential value of a descriptor like visual style and the use of a cataloging study in developing visual style taxonomies. The resulting visual style taxonomy, the methods and analysis described herein may help improve the organization and retrieval of video games and possibly other visual materials like graphic designs, illustrations, and animations.
Inhalt: Vgl.: https://onlinelibrary.wiley.com/doi/abs/10.1002/asi.23988.
Behandelte Form: Spiele
3Hu, X. ; Lee, J.H. ; Bainbridge, D. ; Choi, K. ; Organisciak, P. ; Downie, J.S.: ¬The MIREX grand challenge : a framework of holistic user-experience evaluation in music information retrieval.
In: Journal of the Association for Information Science and Technology. 68(2017) no.1, S.97-112.
Abstract: Music Information Retrieval (MIR) evaluation has traditionally focused on system-centered approaches where components of MIR systems are evaluated against predefined data sets and golden answers (i.e., ground truth). There are two major limitations of such system-centered evaluation approaches: (a) The evaluation focuses on subtasks in music information retrieval, but not on entire systems and (b) users and their interactions with MIR systems are largely excluded. This article describes the first implementation of a holistic user-experience evaluation in MIR, the MIREX Grand Challenge, where complete MIR systems are evaluated, with user experience being the single overarching goal. It is the first time that complete MIR systems have been evaluated with end users in a realistic scenario. We present the design of the evaluation task, the evaluation criteria and a novel evaluation interface, and the data-collection platform. This is followed by an analysis of the results, reflection on the experience and lessons learned, and plans for future directions.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23618/full.
4Jett, J. ; Humpal, N. ; Charles, V. ; Lee, J.-H.: What is a series, really?.
In: Knowledge organization. 44(2017) no.1, S.24-36.
Abstract: As library user needs become increasingly nuanced and technical, the lack of adequate metadata to meet user needs is creating a broadening gulf between library catalog functionality and library user expectations. One of the areas where the dearth of metadata is forming barriers is the idea of "series." While traditional bibliographic definitions of "series" have been adequate to meet user needs in the past, their inability to fully encompass more complex media types beyond simple text is forming barriers against the accessibility of non-traditional formats such as video games, artistic works, datasets, and similar information resources. This article explores the concept of "series" both as it is employed in bibliographic cataloging settings and encompasses actual works. The authors review the term's usage and general meaning across a large variety of media types beyond traditional journals and monographs. Examples are developed as counter-examples to the adequacy of the traditional bibliographic view of series. The authors conclude that the library and information science community as a whole needs to engage in a broader discussion of series cataloging practices and suggest alternate accounts of series that view them as aggregations (like collections) or as containers for intellectual content.
5Lee, J.H. ; Wishkoski, R. ; Aase, L. ; Meas, P. ; Hubbles, C.: Understanding users of cloud music services : selection factors, management and access behavior, and perceptions.
In: Journal of the Association for Information Science and Technology. 68(2017) no.5, S.1186-1200.
Abstract: Recent, rapid changes in technology have resulted in a proliferation of choices for music storage and access. Portable, web-enabled music devices are widespread, and listeners now enjoy a plethora of options regarding formats, devices, and access methods. Yet in this mobile music environment, listeners' access and management strategies for music collections are poorly understood, because behaviors surrounding the organization and retrieval of music collections have received little formal study. Our current research seeks to enrich our knowledge of people's music listening and collecting behavior through a series of systematic user studies. In this paper we present our findings from interviews involving 20 adult and 20 teen users of commercial cloud music services. Our results contribute to theoretical understandings of users' music information behavior in a time of upheaval in music usage patterns, and more generally, the purposes and meanings users ascribe to personal media collections in cloud-based systems. The findings suggest improvements to the future design of cloud-based music services, as well as to any information systems and services designed for personal media collections, benefiting both commercial entities and listeners.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23754/full.
Themenfeld: Information Gateway
6Branch, F. ; Arias, T. ; Kennah, J. ; Phillips, R. ; Windleharth, T. ; Lee, J.H.: Representing transmedia fictional worlds through ontology.
In: Journal of the Association for Information Science and Technology. 68(2017) no.12, S.2771-2782.
Abstract: Currently, there is no structured data standard for representing elements commonly found in transmedia fictional worlds. Although there are websites dedicated to individual universes, the information found on these sites separate out the various formats, concentrate on only the bibliographic aspects of the material, and are only searchable with full text. We have created an ontological model that will allow various user groups interested in transmedia to search for and retrieve the information contained in these worlds based upon their structure. We conducted a domain analysis and user studies based on the contents of Harry Potter, Lord of the Rings, the Marvel Universe, and Star Wars in order to build a new model using Ontology Web Language (OWL) and an artificial intelligence-reasoning engine. This model can infer connections between transmedia properties such as characters, elements of power, items, places, events, and so on. This model will facilitate better search and retrieval of the information contained within these vast story universes for all users interested in them. The result of this project is an OWL ontology reflecting real user needs based upon user research, which is intuitive for users and can be used by artificial intelligence systems.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23886/full.
Themenfeld: Multimedia ; Wissensrepräsentation
7Jett, J. ; Sacchi, S. ; Lee, J.H. ; Clarke, R.I.: ¬A conceptual model for video games and interactive media.
In: Journal of the Association for Information Science and Technology. 67(2016) no.3, S.505-517.
Abstract: In this article, we describe a conceptual model for video games and interactive media. Existing conceptual models such as the Functional Requirements for Bibliographic Records (FRBR) are not adequate to represent the unique descriptive attributes, levels of variance, and relationships among video games. Previous video game-specific models tend to focus on the development of video games and their technical aspects. Our model instead attempts to reflect how users such as game players, collectors, and scholars understand video games and the relationships among them. We specifically consider use cases of gamers, with future intentions of using this conceptual model as a foundation for developing a union catalog for various libraries and museums. In the process of developing the model, we encountered many challenges, including conceptual overlap with and divergence from FRBR, entity scoping, complex relationships among entities, and the question of how to model additional content for game expansion. Future work will focus on making this model interoperable with existing ontologies as well as further understanding and description of content and relationships.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23409/abstract.
Behandelte Form: Spiele ; Videos
8Lee, J.H. ; Price, R.: User experience with commercial music services : an empirical exploration.
In: Journal of the Association for Information Science and Technology. 67(2016) no.4, S.800-811.
Abstract: The music information retrieval (MIR) community has long understood the role of evaluation as a critical component for successful information retrieval systems. Over the past several years, it has also become evident that user-centered evaluation based on realistic tasks is essential for creating systems that are commercially marketable. Although user-oriented research has been increasing, the MIR field is still lacking in holistic, user-centered approaches to evaluating music services beyond measuring the performance of search or classification algorithms. In light of this need, we conducted a user study exploring how users evaluate their overall experience with existing popular commercial music services, asking about their interactions with the system as well as situational and personal characteristics. In this paper, we present a qualitative heuristic evaluation of commercial music services based on Jakob Nielsen's 10 usability heuristics for user interface design, and also discuss 8 additional criteria that may be used for the holistic evaluation of user experience in MIR systems. Finally, we recommend areas of future user research raised by trends and patterns that surfaced from this user study.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23433/abstract.
9Lee, J.H. ; Cho, H. ; Kim, Y.-S.: Users' music information needs and behaviors : design implications for music information retrieval systems.
In: Journal of the Association for Information Science and Technology. 67(2016) no.6, S.1301-1330.
Abstract: User studies in the music information retrieval (MIR) domain tend to be exploratory and qualitative in nature, involving a small number of users, which makes it difficult to derive broader implications for system design. In order to fill this gap, we conducted a large-scale user survey questioning various aspects of people's music information needs and behaviors. In particular, we investigated if general music users' needs and behaviors have significantly changed over time by comparing our current survey results with a similar survey conducted in 2004. In this paper, we present the key findings from the survey data and discuss 4 emergent themes-(a) the shift in access and use of personal music collections; (b) the growing need for tools to support collaborative music seeking, listening, and sharing; (c) the importance of "visual" music experiences; and (d) the need for ontologies for providing rich contextual information. We conclude by making specific recommendations for improving the design of MIR systems and services.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23471/abstract.
10Wang, Y. ; Lee, J.-S. ; Choi, I.-C.: Indexing by Latent Dirichlet Allocation and an Ensemble Model.
In: Journal of the Association for Information Science and Technology. 67(2016) no.7, S.1736-1750.
Abstract: The contribution of this article is twofold. First, we present Indexing by latent Dirichlet allocation (LDI), an automatic document indexing method. Many ad hoc applications, or their variants with smoothing techniques suggested in LDA-based language modeling, can result in unsatisfactory performance as the document representations do not accurately reflect concept space. To improve document retrieval performance, we introduce a new definition of document probability vectors in the context of LDA and present a novel scheme for automatic document indexing based on LDA. Second, we propose an Ensemble Model (EnM) for document retrieval. EnM combines basic indexing models by assigning different weights and attempts to uncover the optimal weights to maximize the mean average precision. To solve the optimization problem, we propose an algorithm, which is derived based on the boosting method. The results of our computational experiments on benchmark data sets indicate that both the proposed approaches are viable options for document retrieval.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23444/abstract.
11Chung, E.K. ; Kwon, N. ; Lee, J.: Understanding scientific collaboration in the research life cycle : bio- and nanoscientists' motivations, information-sharing and communication practices, and barriers to collaboration.
In: Journal of the Association for Information Science and Technology. 67(2016) no.8, S.1836-1848.
Abstract: This study aims to identify the way researchers collaborate with other researchers in the course of the scientific research life cycle and provide information to the designers of e-Science and e-Research implementations. On the basis of in-depth interviews with and on-site observations of 24 scientists and a follow-up focus group interview in the field of bioscience/nanoscience and technology in Korea, we examined scientific collaboration using the framework of the scientific research life cycle. We attempt to explain the major motivations, characteristics of communication and information sharing, and barriers associated with scientists' research collaboration practices throughout the research life cycle. The findings identify several notable phenomena including motivating factors, the timing of collaboration formation, partner selection, communication methods, information-sharing practices, and barriers at each phase of the life cycle. We find that specific motivations were related to specific phases. The formation of collaboration was observed throughout the entire process, not only in the beginning phase of the cycle. For communication and information-sharing practices, scientists continue to favor traditional means of communication for security reasons. Barriers to collaboration throughout the phases included different priorities, competitive tensions, and a hierarchical culture among collaborators, whereas credit sharing was a barrier in the research product phase.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23520/abstract.
12Welhouse, Z. ; Lee, J.H. ; Bancroft, J.: "What am I fighting for?" : creating a controlled vocabulary for video game plot metadata.
In: Cataloging and classification quarterly. 53(2015) no.2, S.157-189.
Abstract: A video game's plot is one of its defining features, and prior research confirms the importance of plot metadata to users through persona analysis, interviews, and surveys. However, existing organizational systems, including library catalogs, game-related websites, and traditional plot classification systems, do not adequately describe the plot information of video games, in other words, what the game is really about. We attempt to address the issue by creating a controlled vocabulary based on a domain analysis involving a review of relevant literature and existing data structures. The controlled vocabulary is constructed in a pair structure for maximizing flexibility and extensibility. Adopting this controlled vocabulary for describing plot information of games will allow for useful search and collocation of video games.
Themenfeld: Metadaten ; Konzeption und Anwendung des Prinzips Thesaurus
Behandelte Form: Videos ; Spiele
13Lee, J.H. ; Clarke, R.I. ; Perti, A.: Empirical evaluation of metadata for video games and interactive media.
In: Journal of the Association for Information Science and Technology. 66(2015) no.12, S.2609-2625.
Abstract: Despite increasing interest in and acknowledgment of the significance of video games, current descriptive practices are not sufficiently robust to support searching, browsing, and other access behaviors from diverse user groups. To address this issue, the Game Metadata Research Group at the University of Washington Information School, in collaboration with the Seattle Interactive Media Museum, worked to create a standardized metadata schema. This metadata schema was empirically evaluated using multiple approaches-collaborative review, schema testing, semi-structured user interview, and a large-scale survey. Reviewing and testing the schema revealed issues and challenges in sourcing the metadata for particular elements, determining the level of granularity for data description, and describing digitally distributed games. The findings from user studies suggest that users value various subject and visual metadata, information about how games are related to each other, and data regarding game expansions/alterations such as additional content and networked features. The metadata schema was extensively revised based on the evaluation results, and we present the new element definitions from the revised schema in this article. This work will serve as a platform and catalyst for advances in the design and use of video game metadata.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23357/abstract.
Behandelte Form: Spiele ; Videos
14Lee, J. ; Min, J.-K. ; Oh, A. ; Chung, C.-W.: Effective ranking and search techniques for Web resources considering semantic relationships.
In: Information processing and management. 50(2014) no.1, S.132-155.
Abstract: On the Semantic Web, the types of resources and the semantic relationships between resources are defined in an ontology. By using that information, the accuracy of information retrieval can be improved. In this paper, we present effective ranking and search techniques considering the semantic relationships in an ontology. Our technique retrieves top-k resources which are the most relevant to query keywords through the semantic relationships. To do this, we propose a weighting measure for the semantic relationship. Based on this measure, we propose a novel ranking method which considers the number of meaningful semantic relationships between a resource and keywords as well as the coverage and discriminating power of keywords. In order to improve the efficiency of the search, we prune the unnecessary search space using the length and weight thresholds of the semantic relationship path. In addition, we exploit Threshold Algorithm based on an extended inverted index to answer top-k results efficiently. The experimental results using real data sets demonstrate that our retrieval method using the semantic information generates accurate results efficiently compared to the traditional methods.
Inhalt: Vgl.: doi: 10.1016/j.ipm.2013.08.007. A short preliminary version of this paper was published in the proceeding of WWW 2009 as a two page poster paper.
Themenfeld: Wissensrepräsentation ; Retrievalalgorithmen
15Lee, J.-T. ; Seo, J. ; Jeon, J. ; Rim, H.-C.: Sentence-based relevance flow analysis for high accuracy retrieval.
In: Journal of the American Society for Information Science and Technology. 62(2011) no.9, S.1666-1675.
Abstract: Traditional ranking models for information retrieval lack the ability to make a clear distinction between relevant and nonrelevant documents at top ranks if both have similar bag-of-words representations with regard to a user query. We aim to go beyond the bag-of-words approach to document ranking in a new perspective, by representing each document as a sequence of sentences. We begin with an assumption that relevant documents are distinguishable from nonrelevant ones by sequential patterns of relevance degrees of sentences to a query. We introduce the notion of relevance flow, which refers to a stream of sentence-query relevance within a document. We then present a framework to learn a function for ranking documents effectively based on various features extracted from their relevance flows and leverage the output to enhance existing retrieval models. We validate the effectiveness of our approach by performing a number of retrieval experiments on three standard test collections, each comprising a different type of document: news articles, medical references, and blog posts. Experimental results demonstrate that the proposed approach can improve the retrieval performance at the top ranks significantly as compared with the state-of-the-art retrieval models regardless of document type.
16Lee, J.H.: Analysis of user needs and information features in natural language queries seeking music information.
In: Journal of the American Society for Information Science and Technology. 61(2010) no.5, S.1025-1045.
Abstract: Our limited understanding of real-life queries is an obstacle in developing music information retrieval (MIR) systems that meet the needs of real users. This study aimed, by an empirical investigation of real-life queries, to contribute to developing a theorized understanding of how users seek music information. This is crucial for informing the design of future MIR systems, especially the selection of potential access points, as well as establishing a set of test queries that reflect real-life music information-seeking behavior. Natural language music queries were collected from an online reference Website and coded using content analysis. A taxonomy of user needs expressed and information features used in queries were established by an iterative coding process. This study found that most of the queries analyzed were known-item searches, and most contained a wide variety of kinds of information, although a few features were used much more heavily than the others. In addition to advancing our understanding of real-life user queries by establishing an improved taxonomy of needs and features, three recommendations were made for improving the evaluation of MIR systems: (i) incorporating user context in test queries, (ii) employing terms familiar to users in evaluation tasks, and (iii) combining multiple task results.
17Lee, J.-H. ; Park, S. ; Ahn, C.-M. ; Kim, D.: Automatic generic document summarization based on non-negative matrix factorization.
In: Information processing and management. 45(2009) no.1, S.20-34.
Abstract: In existing unsupervised methods, Latent Semantic Analysis (LSA) is used for sentence selection. However, the obtained results are less meaningful, because singular vectors are used as the bases for sentence selection from given documents, and singular vector components can have negative values. We propose a new unsupervised method using Non-negative Matrix Factorization (NMF) to select sentences for automatic generic document summarization. The proposed method uses non-negative constraints, which are more similar to the human cognition process. As a result, the method selects more meaningful sentences for generic document summarization than those selected using LSA.
Themenfeld: Automatisches Abstracting
Objekt: Latent semantic analysis
18Kang, I.-S. ; Na, S.-H. ; Lee, S. ; Jung, H. ; Kim, P. ; Sung, W.-K. ; Lee, J.-H.: On co-authorship for author disambiguation.
In: Information processing and management. 45(2009) no.1, S.84-97.
Abstract: Author name disambiguation deals with clustering the same-name authors into different individuals. To attack the problem, many studies have employed a variety of disambiguation features such as coauthors, titles of papers/publications, topics of articles, emails/affiliations, etc. Among these, co-authorship is the most easily accessible and influential, since inter-person acquaintances represented by co-authorship could discriminate the identities of authors more clearly than other features. This study attempts to explore the net effects of co-authorship on author clustering in bibliographic data. First, to handle the shortage of explicit coauthors listed in known citations, a web-assisted technique of acquiring implicit coauthors of the target author to be disambiguated is proposed. Then, a coauthor disambiguation hypothesis that the identity of an author can be determined by his/her coauthors is examined and confirmed through a variety of author disambiguation experiments.
19Ding, Y. ; Yan, E. ; Frazho, A. ; Caverlee, J.: PageRank for ranking authors in co-citation networks.
In: Journal of the American Society for Information Science and Technology. 60(2009) no.11, S.2229-2243.
Abstract: This paper studies how varied damping factors in the PageRank algorithm influence the ranking of authors and proposes weighted PageRank algorithms. We selected the 108 most highly cited authors in the information retrieval (IR) area from the 1970s to 2008 to form the author co-citation network. We calculated the ranks of these 108 authors based on PageRank with the damping factor ranging from 0.05 to 0.95. In order to test the relationship between different measures, we compared PageRank and weighted PageRank results with the citation ranking, h-index, and centrality measures. We found that in our author co-citation network, citation rank is highly correlated with PageRank with different damping factors and also with different weighted PageRank algorithms; citation rank and PageRank are not significantly correlated with centrality measures; and h-index rank does not significantly correlate with centrality measures but does significantly correlate with other measures. The key factors that have impact on the PageRank of authors in the author co-citation network are being co-cited with important authors.
Themenfeld: Suchmaschinen ; Retrievalalgorithmen ; Informetrie
20Kim, P.J. ; Lee, J.Y. ; Park, J.-H.: Developing a new collection-evaluation method : mapping and the user-side h-index.
In: Journal of the American Society for Information Science and Technology. 60(2009) no.11, S.2366-2377.
Abstract: This study proposes a new visualization method and index for collection evaluation. Specifically, it develops a network-based mapping technique and a user-focused Hirsch index (user-side h-index) given the lack of previous studies on collection evaluation methods that have used the h-index. A user-side h-index is developed and compared with previous indices (use factor, difference of percentages, collection-side h-index) that represent the strengths of the subject classes of a library collection. The mapping procedure includes the subject-usage profiling of 63 subject classes and collection-usage map generations through the pathfinder network algorithm. Cluster analyses are then conducted upon the pathfinder network to generate 5 large and 14 small clusters. The nodes represent the strengths of the subject-class usages reflected by the user-side h-index. The user-side h-index was found to have advantages (e.g., better demonstrating the real utility of each subject class) over the other indices. It also can more clearly distinguish the strengths between the subject classes than can collection-side h-index. These results may help to identify actual usage and strengths of subject classes in library collections through visualized maps. This may be a useful rationale for the establishment of the collection-development plan.