Diese Datenbank enthält über 40.000 Dokumente zu Themen aus den Bereichen Formalerschließung – Inhaltserschließung – Information Retrieval.
© 2015 W. Gödert, TH Köln, Institut für Informationswissenschaft / Powered by litecat, BIS Oldenburg (Stand: 03. März 2020)
1Poibeau, T. u.a. (Hrsg.): Multi-source, multilingual information extraction and summarization.
Berlin : Springer, 2013. XX, 323 S.
(Theory and applications of natural language processing)
Abstract: Information extraction (IE) and text summarization (TS) are powerful technologies for finding relevant pieces of information in text and presenting them to the user in condensed form. The ongoing information explosion makes IE and TS critical for successful functioning within the information society. These technologies face particular challenges due to the inherent multi-source nature of the information explosion. The technologies must now handle not isolated texts or individual narratives, but rather large-scale repositories and streams---in general, in multiple languages---containing a multiplicity of perspectives, opinions, or commentaries on particular topics, entities or events. There is thus a need to adapt existing techniques and develop new ones to deal with these challenges. This volume contains a selection of papers that present a variety of methodologies for content identification and extraction, as well as for content fusion and regeneration. The chapters cover various aspects of the challenges, depending on the nature of the information sought---names vs. events,--- and the nature of the sources---news streams vs. image captions vs. scientific research papers, etc. This volume aims to offer a broad and representative sample of studies from this very active research field.
Anmerkung: Rez. in: JASIST 64(2013) no.7, S.1519-1521 (José L. Vicedo, David Tomás)
RSWK: Natürlichsprachiges System / Information Extraction / Automatische Inhaltsanalyse / Zusammenfassung / Aufsatzsammlung
BK: 54.75 (Sprachverarbeitung)
DDC: 006.312 / DDC22ger ; 005.74 / DDC22ger
RVK: ST 530 ; ST 306 ; AN 95300
2Liu, B.: Web data mining : exploring hyperlinks, contents, and usage data.2nd ed.
Heidelberg : Springer, 2011. XX, 622 S.
(Data-centric systems and applications)
Abstract: Web mining aims to discover useful information and knowledge from the Web hyperlink structure, page contents, and usage data. Although Web mining uses many conventional data mining techniques, it is not purely an application of traditional data mining due to the semistructured and unstructured nature of the Web data and its heterogeneity. It has also developed many of its own algorithms and techniques. Liu has written a comprehensive text on Web data mining. Key topics of structure mining, content mining, and usage mining are covered both in breadth and in depth. His book brings together all the essential concepts and algorithms from related areas such as data mining, machine learning, and text processing to form an authoritative and coherent text. The book offers a rich blend of theory and practice, addressing seminal research ideas, as well as examining the technology from a practical point of view. It is suitable for students, researchers and practitioners interested in Web mining both as a learning text and a reference book. Lecturers can readily use it for classes on data mining, Web mining, and Web search. Additional teaching materials such as lecture slides, datasets, and implemented algorithms are available online.
Inhalt: Inhalt: 1. Introduction 2. Association Rules and Sequential Patterns 3. Supervised Learning 4. Unsupervised Learning 5. Partially Supervised Learning 6. Information Retrieval and Web Search 7. Social Network Analysis 8. Web Crawling 9. Structured Data Extraction: Wrapper Generation 10. Information Integration
Anmerkung: Elektronische Ausgabe unter: http://springer.r.delivery.net/r/r?2.1.Ee.2Tp.1gd0L5.C3WE8i..N.WdtE.3uq2.bW89MQ%5f%5fCXPUFOH0.
Themenfeld: Data Mining
RSWK: World Wide Web / Data Mining
BK: 54.72 ; 06.74 ; 06.70 ; 54.32
DDC: 006.312 / DDC22ger ; 005.7402854678 / DDC22ger ; 005.72 / DDC22ger
GHBS: TZG (FH K) ; TWX (FH GE)
RVK: ST 530