Diese Datenbank enthält über 40.000 Dokumente zu Themen aus den Bereichen Formalerschließung – Inhaltserschließung – Information Retrieval.
© 2015 W. Gödert, TH Köln, Institut für Informationswissenschaft / Powered by litecat, BIS Oldenburg (Stand: 28. April 2022)
1Lin, Y.-R. ; Margolin, D. ; Lazer, D.: Uncovering social semantics from textual traces : a theory-driven approach and evidence from public statements of U.S. Members of Congress.
In: Journal of the Association for Information Science and Technology. 67(2016) no.9, S.2072-2089.
Abstract: The increasing abundance of digital textual archives provides an opportunity for understanding human social systems. Yet the literature has not adequately considered the disparate social processes by which texts are produced. Drawing on communication theory, we identify three common processes by which documents might be detectably similar in their textual features-authors sharing subject matter, sharing goals, and sharing sources. We hypothesize that these processes produce distinct, detectable relationships between authors in different kinds of textual overlap. We develop a novel n-gram extraction technique to capture such signatures based on n-grams of different lengths. We test the hypothesis on a corpus where the author attributes are observable: the public statements of the members of the U.S. Congress. This article presents the first empirical finding that shows different social relationships are detectable through the structure of overlapping textual features. Our study has important implications for designing text modeling techniques to make sense of social phenomena from aggregate digital traces.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23540/full.
2Cao, N. ; Sun, J. ; Lin, Y.-R. ; Gotz, D. ; Liu, S. ; Qu, H.: FacetAtlas : Multifaceted visualization for rich text corpora.
In: IEEE Transactions on Visualization and Computer Graphics. InfoVis 2010. [http://systemg.research.ibm.com/apps/facetatlas/cao_infovis10_paper.pdf].
Abstract: Documents in rich text corpora usually contain multiple facets of information. For example, an article about a specific disease often consists of different facets such as symptom, treatment, cause, diagnosis, prognosis, and prevention. Thus, documents may have different relations based on different facets. Powerful search tools have been developed to help users locate lists of individual documents that are most related to specific keywords. However, there is a lack of effective analysis tools that reveal the multifaceted relations of documents within or cross the document clusters. In this paper, we present FacetAtlas, a multifaceted visualization technique for visually analyzing rich text corpora. FacetAtlas combines search technology with advanced visual analytical tools to convey both global and local patterns simultaneously. We describe several unique aspects of FacetAtlas, including (1) node cliques and multifaceted edges, (2) an optimized density map, and (3) automated opacity pattern enhancement for highlighting visual patterns, (4) interactive context switch between facets. In addition, we demonstrate the power of FacetAtlas through a case study that targets patient education in the health care domain. Our evaluation shows the benefits of this work, especially in support of complex multifaceted data analysis.
Inhalt: Vgl. auch: FacetAtlas: Visualizing multifaceted text documents as graphs. Unter: http://systemg.research.ibm.com/apps/facetatlas/index.html.
Themenfeld: Visualisierung ; Wissensrepräsentation ; Semantisches Umfeld in Indexierung u. Retrieval
Objekt: FacetAtlas ; InfoVis