Literatur zur Informationserschließung
Diese Datenbank enthält über 40.000 Dokumente zu Themen aus den Bereichen Formalerschließung – Inhaltserschließung – Information Retrieval.
© 2015 W. Gödert, TH Köln, Institut für Informationswissenschaft
/
Powered by litecat, BIS Oldenburg
(Stand: 28. April 2022)
Suche
Suchergebnisse
Treffer 1–11 von 11
sortiert nach:
-
1Ren, P. ; Chen, Z. ; Ma, J. ; Zhang, Z. ; Si, L. ; Wang, S.: Detecting temporal patterns of user queries.
In: Journal of the Association for Information Science and Technology. 68(2017) no.1, S.113-128.
Abstract: Query classification is an important part of exploring the characteristics of web queries. Existing studies are mainly based on Broder's classification scheme and classify user queries into navigational, informational, and transactional categories according to users' information needs. In this article, we present a novel classification scheme from the perspective of queries' temporal patterns. Queries' temporal patterns are inherent time series patterns of the search volumes of queries that reflect the evolution of the popularity of a query over time. By analyzing the temporal patterns of queries, search engines can more deeply understand the users' search intents and thus improve performance. Furthermore, we extract three groups of features based on the queries' search volume time series and use a support vector machine (SVM) to automatically detect the temporal patterns of user queries. Extensive experiments on the Million Query Track data sets of the Text REtrieval Conference (TREC) demonstrate the effectiveness of our approach.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23578/full.
Themenfeld: Suchtaktik
-
2Cetintas, S. ; Si, L.: Effective query generation and postprocessing strategies for prior art patent search.
In: Journal of the American Society for Information Science and Technology. 63(2012) no.3, S.512-527.
Abstract: Rapid increase in global competition demands increased protection of intellectual property rights and underlines the importance of patents as major intellectual property documents. Prior art patent search is the task of identifying related patents for a given patent file, and is an essential step in judging the validity of a patent application. This article proposes an automated query generation and postprocessing method for prior art patent search. The proposed approach first constructs structured queries by combining terms extracted from different fields of a query patent and then reranks the retrieved patents by utilizing the International Patent Classification (IPC) code similarities between the query patent and the retrieved patents along with the retrieval score. An extensive set of empirical results carried out on a large-scale, real-world dataset shows that utilizing 20 or 30 query terms extracted from all fields of an original query patent according to their log(tf)idf values helps form a representative search query out of the query patent and is found to be more effective than is using any number of query terms from any single field. It is shown that combining terms extracted from different fields of the query patent by giving higher importance to terms extracted from the abstract, claims, and description fields than to terms extracted from the title field is more effective than treating all extracted terms equally while forming the search query. Finally, utilizing the similarities between the IPC codes of the query patent and retrieved patents is shown to be beneficial to improve the effectiveness of the prior art search.
Wissenschaftsfach: Patentinformation
-
3Si, L.E. ; O'Brien, A. ; Probets, S.: Integration of distributed terminology resources to facilitate subject cross-browsing for library portal systems.
In: Aslib proceedings. 62(2010) nos.4/5, S.415-427.
Abstract: Purpose - The paper aims to develop a prototype middleware framework between different terminology resources in order to provide a subject cross-browsing service for library portal systems. Design/methodology/approach - Nine terminology experts were interviewed to collect appropriate knowledge to support the development of a theoretical framework for the research. Based on this, a simplified software-based prototype system was constructed incorporating the knowledge acquired. The prototype involved mappings between the computer science schedule of the Dewey Decimal Classification (which acted as a spine) and two controlled vocabularies, UKAT and ACM Computing Classification. Subsequently, six further experts in the field were invited to evaluate the prototype system and provide feedback to improve the framework. Findings - The major findings showed that, given the large variety of terminology resources distributed throughout the web, the proposed middleware service is essential to integrate technically and semantically the different terminology resources in order to facilitate subject cross-browsing. A set of recommendations are also made, outlining the important approaches and features that support such a cross-browsing middleware service. Originality/value - Cross-browsing features are lacking in current library portal meta-search systems. Users are therefore deprived of this valuable retrieval provision. This research investigated the case for such a system and developed a prototype to fill this gap.
Anmerkung: Beitrag in einem Special Issue: Content architecture: exploiting and managing diverse resources: proceedings of the first national conference of the United Kingdom chapter of the International Society for Knowedge Organization (ISKO)
Themenfeld: Semantische Interoperabilität
Wissenschaftsfach: Informatik
Objekt: DDC ; UKAT ; ACM Computing Classification
-
4Si, L.E. ; O'Brien, A. ; Probets, S.: Integration of distributed terminology resources to facilitate subject cross-browsing for library portal systems.
In: http://www.iskouk.org/conf2009/papers/si_ISKOUK2009.pdf.
Abstract: Purpose: To develop a prototype middleware framework between different terminology resources in order to provide a subject cross-browsing service for library portal systems. Design/methodology/approach: Nine terminology experts were interviewed to collect appropriate knowledge to support the development of a theoretical framework for the research. Based on this, a simplified software-based prototype system was constructed incorporating the knowledge acquired. The prototype involved mappings between the computer science schedule of the Dewey Decimal Classification (which acted as a spine) and two controlled vocabularies UKAT and ACM Computing Classification. Subsequently, six further experts in the field were invited to evaluate the prototype system and provide feedback to improve the framework. Findings: The major findings showed that given the large variety of terminology resources distributed on the web, the proposed middleware service is essential to integrate technically and semantically the different terminology resources in order to facilitate subject cross-browsing. A set of recommendations are also made outlining the important approaches and features that support such a cross browsing middleware service.
Inhalt: This paper is a pre-print version presented at the ISKO UK 2009 conference, 22-23 June, prior to peer review and editing. For published proceedings see special issue of Aslib Proceedings journal.
Themenfeld: Semantische Interoperabilität
Objekt: DDC ; UKAT ; ACM Computing Classification
-
5Rizzolatti, G. ; Fogassi, L. ; Gallese, V.: Spiegel im Gehirn.
In: Spektrum der Wissenschaft. 2007, H.3, S.48-55.
(Spiegelneuronen)
Abstract: Die Entdecker der Spiegelneuronen schildern, wie sie diese bemerkenswerten Hirnzellen aufspürten. sie einen Überblick über die neuere Forschung zu dem Phänomen, die Regungen anderer intuitiv zu verstehen.
Wissenschaftsfach: Kognitionswissenschaft
-
6Si, L.: Encoding formats and consideration of requirements for mapping.
In: http://www.comp.glam.ac.uk/pages/research/hypermedia/nkos/nkos2007/programme.html.
Abstract: With the increasing requirement of establishing semantic mappings between different vocabularies, further development of these encoding formats is becoming more and more important. For this reason, four types of knowledge representation formats were assessed:MARC21 for Classification Data in XML, Zthes XML Schema, XTM(XML Topic Map), and SKOS (Simple Knowledge Organisation System). This paper explores the potential of adapting these representation formats to support different semantic mapping methods, and discusses the implication of extending them to represent more complex KOS.
Inhalt: Präsentation während der Veranstaltung "Networked Knowledge Organization Systems and Services: The 6th European Networked Knowledge Organization Systems (NKOS) Workshop, Workshop at the 11th ECDL Conference, Budapest, Hungary, September 21st 2007".
Themenfeld: Semantische Interoperabilität
Objekt: SKOS
-
7Avrahami, T.T. ; Yau, L. ; Si, L. ; Callan, J.P.: ¬The FedLemur project : Federated search in the real world.
In: Journal of the American Society for Information Science and Technology. 57(2006) no.3, S.347-358.
Abstract: Federated search and distributed information retrieval systems provide a single user interface for searching multiple full-text search engines. They have been an active area of research for more than a decade, but in spite of their success as a research topic, they are still rare in operational environments. This article discusses a prototype federated search system developed for the U.S. government's FedStats Web portal, and the issues addressed in adapting research solutions to this operational environment. A series of experiments explore how well prior research results, parameter settings, and heuristics apply in the FedStats environment. The article concludes with a set of lessons learned from this technology transfer effort, including observations about search engine quality in the real world.
Themenfeld: Verteilte bibliographische Datenbanken
-
8Si, L.: ¬The status quo and future development of cataloging and classification education in China.
In: Cataloging and classification quarterly. 41(2005) no.2, S.85-103.
Abstract: This article depicts the status quo of cataloging and classification education in China, including the library science programs, their curricula, the degrees offered, the contents of courses, and the selection of textbooks. It also analyzes the current problems in library science programs and projects the possible improvements and progress in the teaching in the next five to ten years.
Anmerkung: Beitrag eines Themenheftes "Education for cataloging: international perspectives. Part I"
Themenfeld: Ausbildung
Land/Ort: China
-
9Mailho-Daboussi, L.: Creation d'un thesaurus images : l'experience de la banque de données Iconos sur les fonds de photographies.
In: Documentaliste. 32(1995) no.2, S.99-105.
Abstract: Describes the development of ICONOS, the Documentation Francaise's database for tracking and describing publicly accessible collections of photographs in France. A thesaurus specific to the database was constructed to rationalize and standardise previous work with several different indexing tools. The stages of the thesaurus design included creating a body of descriptors based on those already in use; statistical and qualitative assessment of indexing terms; surmounting semantic and linguistic problems and finally the collection of terminology into a multidisciplinary thesaurus
Behandelte Form: Bilder
Objekt: ICONOS
-
10Nubila, B. di ; Gagliardi, I. ; Macchi, D. ; Milanesi, L. ; Padula, M. ; Pagani, R.: Concept-based indexing and retrieval of multimedia documents.
In: Journal of information science. 20(1994) no.3, S.185-196.
Abstract: In this work, we face the problem of multimedia document indexing with reference to a specific application field, the radiological ward where automatic information managemenmt by content is an urgent need. Here, a multimedia document is composed of text and images. The keystone of the approach is the image indexing which is performed in an indirect way: the description of the image (made by an expert, in our case a physician) is further synthesised and formalised to be used by the computer. In this paper, we propose a concept-based indexing of the description of the images which is based on Farradane's work. The basic proposal has been extended to deal with specific requirements of the considered application and to be automatically performed. A first prototype of a multimedia information retrieval system has been implemented with the goal of validating the method in the specific application
Behandelte Form: Bilder
-
11Cassi, L.: Relational tools to manage pictorial information.
In: Journal of information science. 18(1992) no.5, S.375-398.
Abstract: Dafine a data model to organize and manage the indexing structure of a pictorial information retrieval system. Describes relational tools to manage the information drawn from digital images. Such tools help to organize data according to the numerical/structural user model adopted to describe the images. They allow the storage, manipulation and management of complex pictorial object descriptions which take into account objects, sub-objetcs and their composition relationships, as well as the specialization relationship among their type
Behandelte Form: Bilder