Search (243 results, page 1 of 13)

Multilingual information management : current levels and future abilities. A report Commissioned by the US National Science Foundation and also delivered to the European Commission's Language Engineering Office and the US Defense Advanced Research Projects Agency, April 1999 (1999) 0.03
```
0.0329042 = product of:
  0.09871259 = sum of:
    0.016146496 = weight(_text_:web in 6068) [ClassicSimilarity], result of:
      0.016146496 = score(doc=6068,freq=2.0), product of:
        0.111951075 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03430388 = queryNorm
        0.14422815 = fieldWeight in 6068, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03125 = fieldNorm(doc=6068)
    0.014015876 = weight(_text_:information in 6068) [ClassicSimilarity], result of:
      0.014015876 = score(doc=6068,freq=18.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.23274568 = fieldWeight in 6068, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03125 = fieldNorm(doc=6068)
    0.053511675 = weight(_text_:extraction in 6068) [ClassicSimilarity], result of:
      0.053511675 = score(doc=6068,freq=2.0), product of:
        0.20380433 = queryWeight, product of:
          5.941145 = idf(docFreq=315, maxDocs=44218)
          0.03430388 = queryNorm
        0.26256397 = fieldWeight in 6068, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.941145 = idf(docFreq=315, maxDocs=44218)
          0.03125 = fieldNorm(doc=6068)
    0.015038553 = weight(_text_:system in 6068) [ClassicSimilarity], result of:
      0.015038553 = score(doc=6068,freq=2.0), product of:
        0.10804188 = queryWeight, product of:
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.03430388 = queryNorm
        0.13919188 = fieldWeight in 6068, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.03125 = fieldNorm(doc=6068)
  0.33333334 = coord(4/12)
```
Abstract

Over the past 50 years, a variety of language-related capabilities has been developed in machine translation, information retrieval, speech recognition, text summarization, and so on. These applications rest upon a set of core techniques such as language modeling, information extraction, parsing, generation, and multimedia planning and integration; and they involve methods using statistics, rules, grammars, lexicons, ontologies, training techniques, and so on. It is a puzzling fact that although all of this work deals with language in some form or other, the major applications have each developed a separate research field. For example, there is no reason why speech recognition techniques involving n-grams and hidden Markov models could not have been used in machine translation 15 years earlier than they were, or why some of the lexical and semantic insights from the subarea called Computational Linguistics are still not used in information retrieval.
This picture will rapidly change. The twin challenges of massive information overload via the web and ubiquitous computers present us with an unavoidable task: developing techniques to handle multilingual and multi-modal information robustly and efficiently, with as high quality performance as possible. The most effective way for us to address such a mammoth task, and to ensure that our various techniques and applications fit together, is to start talking across the artificial research boundaries. Extending the current technologies will require integrating the various capabilities into multi-functional and multi-lingual natural language systems. However, at this time there is no clear vision of how these technologies could or should be assembled into a coherent framework. What would be involved in connecting a speech recognition system to an information retrieval engine, and then using machine translation and summarization software to process the retrieved text? How can traditional parsing and generation be enhanced with statistical techniques? What would be the effect of carefully crafted lexicons on traditional information retrieval? At which points should machine translation be interleaved within information retrieval systems to enable multilingual processing?

Bian, G.-W.; Chen, H.-H.: Cross-language information access to multilingual collections on the Internet (2000) 0.03

0.027630312 = product of:
  0.082890935 = sum of:
    0.03425189 = weight(_text_:web in 4436) [ClassicSimilarity], result of:
      0.03425189 = score(doc=4436,freq=4.0), product of:
        0.111951075 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03430388 = queryNorm
        0.3059541 = fieldWeight in 4436, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=4436)
    0.012138106 = weight(_text_:information in 4436) [ClassicSimilarity], result of:
      0.012138106 = score(doc=4436,freq=6.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.20156369 = fieldWeight in 4436, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=4436)
    0.02255783 = weight(_text_:system in 4436) [ClassicSimilarity], result of:
      0.02255783 = score(doc=4436,freq=2.0), product of:
        0.10804188 = queryWeight, product of:
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.03430388 = queryNorm
        0.20878783 = fieldWeight in 4436, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.046875 = fieldNorm(doc=4436)
    0.013943106 = product of:
      0.027886212 = sum of:
        0.027886212 = weight(_text_:22 in 4436) [ClassicSimilarity], result of:
          0.027886212 = score(doc=4436,freq=2.0), product of:
            0.120126344 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03430388 = queryNorm
            0.23214069 = fieldWeight in 4436, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=4436)
      0.5 = coord(1/2)
  0.33333334 = coord(4/12)

Abstract: Language barrier is the major problem that people face in searching for, retrieving, and understanding multilingual collections on the Internet. This paper deals with query translation and document translation in a Chinese-English information retrieval system called MTIR. Bilingual dictionary and monolingual corpus-based approaches are adopted to select suitable tranlated query terms. A machine transliteration algorithm is introduced to resolve proper name searching. We consider several design issues for document translation, including which material is translated, what roles the HTML tags play in translation, what the tradeoff is between the speed performance and the translation performance, and what from the translated result is presented in. About 100.000 Web pages translated in the last 4 months of 1997 are used for quantitative study of online and real-time Web page translation
Date: 16. 2.2000 14:22:39
Source: Journal of the American Society for Information Science. 51(2000) no.3, S.281-296

Cao, L.; Leong, M.-K.; Low, H.-B.: Searching heterogeneous multilingual bibliographic sources (1998) 0.03

0.026698861 = product of:
  0.106795445 = sum of:
    0.045669187 = weight(_text_:web in 3564) [ClassicSimilarity], result of:
      0.045669187 = score(doc=3564,freq=4.0), product of:
        0.111951075 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03430388 = queryNorm
        0.4079388 = fieldWeight in 3564, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0625 = fieldNorm(doc=3564)
    0.04253545 = weight(_text_:system in 3564) [ClassicSimilarity], result of:
      0.04253545 = score(doc=3564,freq=4.0), product of:
        0.10804188 = queryWeight, product of:
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.03430388 = queryNorm
        0.3936941 = fieldWeight in 3564, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.0625 = fieldNorm(doc=3564)
    0.01859081 = product of:
      0.03718162 = sum of:
        0.03718162 = weight(_text_:22 in 3564) [ClassicSimilarity], result of:
          0.03718162 = score(doc=3564,freq=2.0), product of:
            0.120126344 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03430388 = queryNorm
            0.30952093 = fieldWeight in 3564, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=3564)
      0.5 = coord(1/2)
  0.25 = coord(3/12)

Abstract: Propopses a Web-based architecture for searching distributed heterogeneous multi-asian language bibliographic sources, and describes a successful pilot implementation of the system at the Chinese Library (CLib) system developed in Singapore and tested at 2 university libraries and a public library
Date: 1. 8.1996 22:08:06
Footnote: Contribution to a special issue devoted to the Proceedings of the 7th International World Wide Web Conference, held 14-18 April 1998, Brisbane, Australia

Wang, F.L.; Yang, C.C.: ¬The impact analysis of language differences on an automatic multilingual text summarization system (2006) 0.02

0.024341919 = product of:
  0.097367674 = sum of:
    0.011679897 = weight(_text_:information in 5049) [ClassicSimilarity], result of:
      0.011679897 = score(doc=5049,freq=8.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.19395474 = fieldWeight in 5049, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5049)
    0.06688959 = weight(_text_:extraction in 5049) [ClassicSimilarity], result of:
      0.06688959 = score(doc=5049,freq=2.0), product of:
        0.20380433 = queryWeight, product of:
          5.941145 = idf(docFreq=315, maxDocs=44218)
          0.03430388 = queryNorm
        0.32820496 = fieldWeight in 5049, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.941145 = idf(docFreq=315, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5049)
    0.018798191 = weight(_text_:system in 5049) [ClassicSimilarity], result of:
      0.018798191 = score(doc=5049,freq=2.0), product of:
        0.10804188 = queryWeight, product of:
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.03430388 = queryNorm
        0.17398985 = fieldWeight in 5049, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5049)
  0.25 = coord(3/12)

Abstract: Based on the salient features of the documents, automatic text summarization systems extract the key sentences from source documents. This process supports the users in evaluating the relevance of the extracted documents returned by information retrieval systems. Because of this tool, efficient filtering can be achieved. Indirectly, these systems help to resolve the problem of information overloading. Many automatic text summarization systems have been implemented for use with different languages. It has been established that the grammatical and lexical differences between languages have a significant effect on text processing. However, the impact of the language differences on the automatic text summarization systems has not yet been investigated. The authors provide an impact analysis of language difference on automatic text summarization. It includes the effect on the extraction processes, the scoring mechanisms, the performance, and the matching of the extracted sentences, using the parallel corpus in English and Chinese as the tested object. The analysis results provide a greater understanding of language differences and promote the future development of more advanced text summarization techniques.
Footnote: Beitrag einer special topic section on multilingual information systems
Source: Journal of the American Society for Information Science and Technology. 57(2006) no.5, S.684-696

Ye, Z.; Huang, J.X.; He, B.; Lin, H.: Mining a multilingual association dictionary from Wikipedia for cross-language information retrieval (2012) 0.02

0.02429695 = product of:
  0.0971878 = sum of:
    0.02018312 = weight(_text_:web in 513) [ClassicSimilarity], result of:
      0.02018312 = score(doc=513,freq=2.0), product of:
        0.111951075 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03430388 = queryNorm
        0.18028519 = fieldWeight in 513, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=513)
    0.010115089 = weight(_text_:information in 513) [ClassicSimilarity], result of:
      0.010115089 = score(doc=513,freq=6.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.16796975 = fieldWeight in 513, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=513)
    0.06688959 = weight(_text_:extraction in 513) [ClassicSimilarity], result of:
      0.06688959 = score(doc=513,freq=2.0), product of:
        0.20380433 = queryWeight, product of:
          5.941145 = idf(docFreq=315, maxDocs=44218)
          0.03430388 = queryNorm
        0.32820496 = fieldWeight in 513, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.941145 = idf(docFreq=315, maxDocs=44218)
          0.0390625 = fieldNorm(doc=513)
  0.25 = coord(3/12)

Abstract: Wikipedia is characterized by its dense link structure and a large number of articles in different languages, which make it a notable Web corpus for knowledge extraction and mining, in particular for mining the multilingual associations. In this paper, motivated by a psychological theory of word meaning, we propose a graph-based approach to constructing a cross-language association dictionary (CLAD) from Wikipedia, which can be used in a variety of cross-language accessing and processing applications. In order to evaluate the quality of the mined CLAD, and to demonstrate how the mined CLAD can be used in practice, we explore two different applications of the mined CLAD to cross-language information retrieval (CLIR). First, we use the mined CLAD to conduct cross-language query expansion; and, second, we use it to filter out translation candidates with low translation probabilities. Experimental results on a variety of standard CLIR test collections show that the CLIR retrieval performance can be substantially improved with the above two applications of CLAD, which indicates that the mined CLAD is of sound quality.
Source: Journal of the American Society for Information Science and Technology. 63(2012) no.12, S.2474-2487

Effektive Information Retrieval Verfahren in Theorie und Praxis : ausgewählte und erweiterte Beiträge des Vierten Hildesheimer Evaluierungs- und Retrievalworkshop (HIER 2005), Hildesheim, 20.7.2005 (2006) 0.02
```
0.023171356 = product of:
  0.069514066 = sum of:
    0.011417297 = weight(_text_:web in 5973) [ClassicSimilarity], result of:
      0.011417297 = score(doc=5973,freq=4.0), product of:
        0.111951075 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03430388 = queryNorm
        0.1019847 = fieldWeight in 5973, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.015625 = fieldNorm(doc=5973)
    0.011443916 = weight(_text_:information in 5973) [ClassicSimilarity], result of:
      0.011443916 = score(doc=5973,freq=48.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.19003606 = fieldWeight in 5973, product of:
          6.928203 = tf(freq=48.0), with freq of:
            48.0 = termFreq=48.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.015625 = fieldNorm(doc=5973)
    0.02675872 = weight(_text_:suche in 5973) [ClassicSimilarity], result of:
      0.02675872 = score(doc=5973,freq=4.0), product of:
        0.17138755 = queryWeight, product of:
          4.996156 = idf(docFreq=812, maxDocs=44218)
          0.03430388 = queryNorm
        0.15612988 = fieldWeight in 5973, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          4.996156 = idf(docFreq=812, maxDocs=44218)
          0.015625 = fieldNorm(doc=5973)
    0.019894136 = weight(_text_:system in 5973) [ClassicSimilarity], result of:
      0.019894136 = score(doc=5973,freq=14.0), product of:
        0.10804188 = queryWeight, product of:
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.03430388 = queryNorm
        0.18413356 = fieldWeight in 5973, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.015625 = fieldNorm(doc=5973)
  0.33333334 = coord(4/12)
```
Abstract

Information Retrieval hat sich zu einer Schlüsseltechnologie in der Wissensgesellschaft entwickelt. Die Anzahl der täglichen Anfragen an Internet-Suchmaschinen bildet nur einen Indikator für die große Bedeutung dieses Themas. Der Sammelbandband informiert über Themen wie Information Retrieval-Grundlagen, Retrieval Systeme, Digitale Bibliotheken, Evaluierung und Multilinguale Systeme, beschreibt Anwendungsszenarien und setzt sich mit neuen Herausforderungen an das Information Retrieval auseinander. Die Beiträge behandeln aktuelle Themen und neue Herausforderungen an das Information Retrieval. Die intensive Beteiligung der Informationswissenschaft der Universität Hildesheim am Cross Language Evaluation Forum (CLEF), einer europäischen Evaluierungsinitiative zur Erforschung mehrsprachiger Retrieval Systeme, berührt mehrere der Beiträge. Ebenso spielen Anwendungsszenarien und die Auseinandersetzung mit aktuellen und praktischen Fragestellungen eine große Rolle.

Content

Inhalt: Jan-Hendrik Scheufen: RECOIN: Modell offener Schnittstellen für Information-Retrieval-Systeme und -Komponenten Markus Nick, Klaus-Dieter Althoff: Designing Maintainable Experience-based Information Systems Gesine Quint, Steffen Weichert: Die benutzerzentrierte Entwicklung des Produkt- Retrieval-Systems EIKON der Blaupunkt GmbH Claus-Peter Klas, Sascha Kriewel, André Schaefer, Gudrun Fischer: Das DAFFODIL System - Strategische Literaturrecherche in Digitalen Bibliotheken Matthias Meiert: Entwicklung eines Modells zur Integration digitaler Dokumente in die Universitätsbibliothek Hildesheim Daniel Harbig, René Schneider: Ontology Learning im Rahmen von MyShelf Michael Kluck, Marco Winter: Topic-Entwicklung und Relevanzbewertung bei GIRT: ein Werkstattbericht Thomas Mandl: Neue Entwicklungen bei den Evaluierungsinitiativen im Information Retrieval Joachim Pfister: Clustering von Patent-Dokumenten am Beispiel der Datenbanken des Fachinformationszentrums Karlsruhe Ralph Kölle, Glenn Langemeier, Wolfgang Semar: Programmieren lernen in kollaborativen Lernumgebungen Olga Tartakovski, Margaryta Shramko: Implementierung eines Werkzeugs zur Sprachidentifikation in mono- und multilingualen Texten Nina Kummer: Indexierungstechniken für das japanische Retrieval Suriya Na Nhongkai, Hans-Joachim Bentz: Bilinguale Suche mittels Konzeptnetzen Robert Strötgen, Thomas Mandl, René Schneider: Entwicklung und Evaluierung eines Question Answering Systems im Rahmen des Cross Language Evaluation Forum (CLEF) Niels Jensen: Evaluierung von mehrsprachigem Web-Retrieval: Experimente mit dem EuroGOV-Korpus im Rahmen des Cross Language Evaluation Forum (CLEF)

Footnote

Rez. in: Information - Wissenschaft und Praxis 57(2006) H.5, S.290-291 (C. Schindler): "Weniger als ein Jahr nach dem "Vierten Hildesheimer Evaluierungs- und Retrievalworkshop" (HIER 2005) im Juli 2005 ist der dazugehörige Tagungsband erschienen. Eingeladen hatte die Hildesheimer Informationswissenschaft um ihre Forschungsergebnisse und die einiger externer Experten zum Thema Information Retrieval einem Fachpublikum zu präsentieren und zur Diskussion zu stellen. Unter dem Titel "Effektive Information Retrieval Verfahren in Theorie und Praxis" sind nahezu sämtliche Beiträge des Workshops in dem nun erschienenen, 15 Beiträge umfassenden Band gesammelt. Mit dem Schwerpunkt Information Retrieval (IR) wird ein Teilgebiet der Informationswissenschaft vorgestellt, das schon immer im Zentrum informationswissenschaftlicher Forschung steht. Ob durch den Leistungsanstieg von Prozessoren und Speichermedien, durch die Verbreitung des Internet über nationale Grenzen hinweg oder durch den stetigen Anstieg der Wissensproduktion, festzuhalten ist, dass in einer zunehmend wechselseitig vernetzten Welt die Orientierung und das Auffinden von Dokumenten in großen Wissensbeständen zu einer zentralen Herausforderung geworden sind. Aktuelle Verfahrensweisen zu diesem Thema, dem Information Retrieval, präsentiert der neue Band anhand von praxisbezogenen Projekten und theoretischen Diskussionen. Das Kernthema Information Retrieval wird in dem Sammelband in die Bereiche Retrieval-Systeme, Digitale Bibliothek, Evaluierung und Multilinguale Systeme untergliedert. Die Artikel der einzelnen Sektionen sind insgesamt recht heterogen und bieten daher keine Überschneidungen inhaltlicher Art. Jedoch ist eine vollkommene thematische Abdeckung der unterschiedlichen Bereiche ebenfalls nicht gegeben, was bei der Präsentation von Forschungsergebnissen eines Institutes und seiner Kooperationspartner auch nur bedingt erwartet werden kann. So lässt sich sowohl in der Gliederung als auch in den einzelnen Beiträgen eine thematische Verdichtung erkennen, die das spezielle Profil und die Besonderheit der Hildesheimer Informationswissenschaft im Feld des Information Retrieval wiedergibt. Teil davon ist die mehrsprachige und interdisziplinäre Ausrichtung, die die Schnittstellen zwischen Informationswissenschaft, Sprachwissenschaft und Informatik in ihrer praxisbezogenen und internationalen Forschung fokussiert.
Im ersten Kapitel "Retrieval-Systeme" werden verschiedene Information RetrievalSysteme präsentiert und Verfahren zu deren Gestaltung diskutiert. Jan-Hendrik Scheufen stellt das Meta-Framework RECOIN zur Information Retrieval Forschung vor, das sich durch eine flexible Handhabung unterschiedlichster Applikationen auszeichnet und dadurch eine zentrierte Protokollierung und Steuerung von Retrieval-Prozessen ermöglicht. Dieses Konzept eines offenen, komponentenbasierten Systems wurde in Form eines Plug-Ins für die javabasierte Open-Source-Plattform Eclipse realisiert. Markus Nick und Klaus-Dieter Althoff erläutern in ihrem Beitrag, der übrigens der einzige englischsprachige Text im Buch ist, das Verfahren DILLEBIS zur Erhaltung und Pflege (Maintenance) von erfahrungsbasierten Informationssystemen. Sie bezeichnen dieses Verfahren als Maintainable Experience-based Information System und plädieren für eine Ausrichtung von erfahrungsbasierten Systemen entsprechend diesem Modell. Gesine Quint und Steffen Weichert stellen dagegen in ihrem Beitrag die benutzerzentrierte Entwicklung des Produkt-Retrieval-Systems EIKON vor, das in Kooperation mit der Blaupunkt GmbH realisiert wurde. In einem iterativen Designzyklus erfolgte die Gestaltung von gruppenspezifischen Interaktionsmöglichkeiten für ein Car-Multimedia-Zubehör-System. Im zweiten Kapitel setzen sich mehrere Autoren dezidierter mit dem Anwendungsgebiet "Digitale Bibliothek" auseinander. Claus-Peter Klas, Sascha Kriewel, Andre Schaefer und Gudrun Fischer von der Universität Duisburg-Essen stellen das System DAFFODIL vor, das durch eine Vielzahl an Werkzeugen zur strategischen Unterstützung bei Literaturrecherchen in digitalen Bibliotheken dient. Zusätzlich ermöglicht die Protokollierung sämtlicher Ereignisse den Einsatz des Systems als Evaluationsplattform. Der Aufsatz von Matthias Meiert erläutert die Implementierung von elektronischen Publikationsprozessen an Hochschulen am Beispiel von Abschlussarbeiten des Studienganges Internationales Informationsmanagement der Universität Hildesheim. Neben Rahmenbedingungen werden sowohl der Ist-Zustand als auch der Soll-Zustand des wissenschaftlichen elektronischen Publizierens in Form von gruppenspezifischen Empfehlungen dargestellt. Daniel Harbig und Rene Schneider beschreiben in ihrem Aufsatz zwei Verfahrensweisen zum maschinellen Erlernen von Ontologien, angewandt am virtuellen Bibliotheksregal MyShelf. Nach der Evaluation dieser beiden Ansätze plädieren die Autoren für ein semi-automatisiertes Verfahren zur Erstellung von Ontologien.
"Evaluierung", das Thema des dritten Kapitels, ist in seiner Breite nicht auf das Information Retrieval beschränkt sondern beinhaltet ebenso einzelne Aspekte der Bereiche Mensch-Maschine-Interaktion sowie des E-Learning. Michael Muck und Marco Winter von der Stiftung Wissenschaft und Politik sowie dem Informationszentrum Sozialwissenschaften thematisieren in ihrem Beitrag den Einfluss der Fragestellung (Topic) auf die Bewertung von Relevanz und zeigen Verfahrensweisen für die Topic-Erstellung auf, die beim Cross Language Evaluation Forum (CLEF) Anwendung finden. Im darauf folgenden Aufsatz stellt Thomas Mandl verschiedene Evaluierungsinitiativen im Information Retrieval und aktuelle Entwicklungen dar. Joachim Pfister erläutert in seinem Beitrag das automatisierte Gruppieren, das sogenannte Clustering, von Patent-Dokumenten in den Datenbanken des Fachinformationszentrums Karlsruhe und evaluiert unterschiedliche Clusterverfahren auf Basis von Nutzerbewertungen. Ralph Kölle, Glenn Langemeier und Wolfgang Semar widmen sich dem kollaborativen Lernen unter den speziellen Bedingungen des Programmierens. Dabei werden das System VitaminL zur synchronen Bearbeitung von Programmieraufgaben und das Kennzahlensystem K-3 für die Bewertung kollaborativer Zusammenarbeit in einer Lehrveranstaltung angewendet. Der aktuelle Forschungsschwerpunkt der Hildesheimer Informationswissenschaft zeichnet sich im vierten Kapitel unter dem Thema "Multilinguale Systeme" ab. Hier finden sich die meisten Beiträge des Tagungsbandes wieder. Olga Tartakovski und Margaryta Shramko beschreiben und prüfen das System Langldent, das die Sprache von mono- und multilingualen Texten identifiziert. Die Eigenheiten der japanischen Schriftzeichen stellt Nina Kummer dar und vergleicht experimentell die unterschiedlichen Techniken der Indexierung. Suriya Na Nhongkai und Hans-Joachim Bentz präsentieren und prüfen eine bilinguale Suche auf Basis von Konzeptnetzen, wobei die Konzeptstruktur das verbindende Elemente der beiden Textsammlungen darstellt. Das Entwickeln und Evaluieren eines mehrsprachigen Question-Answering-Systems im Rahmen des Cross Language Evaluation Forum (CLEF), das die alltagssprachliche Formulierung von konkreten Fragestellungen ermöglicht, wird im Beitrag von Robert Strötgen, Thomas Mandl und Rene Schneider thematisiert. Den Schluss bildet der Aufsatz von Niels Jensen, der ein mehrsprachiges Web-Retrieval-System ebenfalls im Zusammenhang mit dem CLEF anhand des multilingualen EuroGOVKorpus evaluiert.
Abschließend lässt sich sagen, dass der Tagungsband einen gelungenen Überblick über die Information Retrieval Projekte der Hildesheimer Informationswissenschaft und ihrer Kooperationspartner gibt. Die einzelnen Beiträge sind sehr anregend und auf einem hohen Niveau angesiedelt. Ein kleines Hindernis für den Leser stellt die inhaltliche und strukturelle Orientierung innerhalb des Bandes dar. Der Bezug der einzelnen Artikel zum Thema des Kapitels wird zwar im Vorwort kurz erläutert. Erschwert wird die Orientierung im Buch jedoch durch fehlende Kapitelüberschriften am Anfang der einzelnen Sektionen. Außerdem ist zu erwähnen, dass einer der Artikel einen anderen Titel als im Inhaltsverzeichnis angekündigt trägt. Sieht der Leser von diesen formalen Mängeln ab, wird er reichlich mit praxisbezogenen und theoretisch fundierten Projektdarstellungen und Forschungsergebnissen belohnt. Dies insbesondere, da nicht nur aktuelle Themen der Informationswissenschaft aufgegriffen, sondern ebenso weiterentwickelt und durch die speziellen interdisziplinären und internationalen Bedingungen in Hildesheim geformt werden. Dabei zeigt sich anhand der verschiedenen Projekte, wie gut die Hildesheimer Informationswissenschaft in die Community überregionaler Informationseinrichtungen und anderer deutscher informationswissenschaftlicher Forschungsgruppen eingebunden ist. Damit hat der Workshop bei einer weiteren Öffnung der Expertengruppe das Potential zu einer eigenständigen Institution im Bereich des Information Retrieval zu werden. In diesem Sinne lässt sich auf weitere fruchtbare Workshops und deren Veröffentlichungen hoffen. Ein nächster Workshop der Universität Hildesheim zum Thema Information Retrieval, organisiert mit der Fachgruppe Information Retrieval der Gesellschaft für Informatik, kündigt sich bereits für den 9. bis 13- Oktober 2006 an."

Wang, J.-H.; Teng, J.-W.; Lu, W.-H.; Chien, L.-F.: Exploiting the Web as the multilingual corpus for unknown query translation (2006) 0.02

0.023119796 = product of:
  0.092479184 = sum of:
    0.048439488 = weight(_text_:web in 5050) [ClassicSimilarity], result of:
      0.048439488 = score(doc=5050,freq=8.0), product of:
        0.111951075 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03430388 = queryNorm
        0.43268442 = fieldWeight in 5050, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=5050)
    0.012138106 = weight(_text_:information in 5050) [ClassicSimilarity], result of:
      0.012138106 = score(doc=5050,freq=6.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.20156369 = fieldWeight in 5050, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=5050)
    0.031901587 = weight(_text_:system in 5050) [ClassicSimilarity], result of:
      0.031901587 = score(doc=5050,freq=4.0), product of:
        0.10804188 = queryWeight, product of:
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.03430388 = queryNorm
        0.29527056 = fieldWeight in 5050, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.046875 = fieldNorm(doc=5050)
  0.25 = coord(3/12)

Abstract: Users' cross-lingual queries to a digital library system might be short and the query terms may not be included in a common translation dictionary (unknown terms). In this article, the authors investigate the feasibility of exploiting the Web as the multilingual corpus source to translate unknown query terms for cross-language information retrieval in digital libraries. They propose a Webbased term translation approach to determine effective translations for unknown query terms by mining bilingual search-result pages obtained from a real Web search engine. This approach can enhance the construction of a domain-specific bilingual lexicon and bring multilingual support to a digital library that only has monolingual document collections. Very promising results have been obtained in generating effective translation equivalents for many unknown terms, including proper nouns, technical terms, and Web query terms, and in assisting bilingual lexicon construction for a real digital library system.
Footnote: Beitrag einer special topic section on multilingual information systems
Source: Journal of the American Society for Information Science and Technology. 57(2006) no.5, S.660-670

Mustafa el Hadi, W.: Human language technology and its role in information access and management (2003) 0.02
```
0.022387289 = product of:
  0.13432373 = sum of:
    0.01846754 = weight(_text_:information in 5524) [ClassicSimilarity], result of:
      0.01846754 = score(doc=5524,freq=20.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.30666938 = fieldWeight in 5524, product of:
          4.472136 = tf(freq=20.0), with freq of:
            20.0 = termFreq=20.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5524)
    0.115856186 = weight(_text_:extraction in 5524) [ClassicSimilarity], result of:
      0.115856186 = score(doc=5524,freq=6.0), product of:
        0.20380433 = queryWeight, product of:
          5.941145 = idf(docFreq=315, maxDocs=44218)
          0.03430388 = queryNorm
        0.56846774 = fieldWeight in 5524, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          5.941145 = idf(docFreq=315, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5524)
  0.16666667 = coord(2/12)
```
Abstract

The role of linguistics in information access, extraction and dissemination is essential. Radical changes in the techniques of information and communication at the end of the twentieth century have had a significant effect on the function of the linguistic paradigm and its applications in all forms of communication. The introduction of new technical means have deeply changed the possibilities for the distribution of information. In this situation, what is the role of the linguistic paradigm and its practical applications, i.e., natural language processing (NLP) techniques when applied to information access? What solutions can linguistics offer in human computer interaction, extraction and management? Many fields show the relevance of the linguistic paradigm through the various technologies that require NLP, such as document and message understanding, information detection, extraction, and retrieval, question and answer, cross-language information retrieval (CLIR), text summarization, filtering, and spoken document retrieval. This paper focuses on the central role of human language technologies in the information society, surveys the current situation, describes the benefits of the above mentioned applications, outlines successes and challenges, and discusses solutions. It reviews the resources and means needed to advance information access and dissemination across language boundaries in the twenty-first century. Multilingualism, which is a natural result of globalization, requires more effort in the direction of language technology. The scope of human language technology (HLT) is large, so we limit our review to applications that involve multilinguality.

Content

Beitrag eines Themenheftes "Knowledge organization and classification in international information retrieval"
Ludwig, L.: Lösung zum multilingualen Wissensmanagement semantischer Informationen (2010) 0.02
```
0.021571122 = product of:
  0.08628449 = sum of:
    0.02018312 = weight(_text_:web in 4281) [ClassicSimilarity], result of:
      0.02018312 = score(doc=4281,freq=2.0), product of:
        0.111951075 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03430388 = queryNorm
        0.18028519 = fieldWeight in 4281, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4281)
    0.047303177 = weight(_text_:suche in 4281) [ClassicSimilarity], result of:
      0.047303177 = score(doc=4281,freq=2.0), product of:
        0.17138755 = queryWeight, product of:
          4.996156 = idf(docFreq=812, maxDocs=44218)
          0.03430388 = queryNorm
        0.27600124 = fieldWeight in 4281, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.996156 = idf(docFreq=812, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4281)
    0.018798191 = weight(_text_:system in 4281) [ClassicSimilarity], result of:
      0.018798191 = score(doc=4281,freq=2.0), product of:
        0.10804188 = queryWeight, product of:
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.03430388 = queryNorm
        0.17398985 = fieldWeight in 4281, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4281)
  0.25 = coord(3/12)
```
Abstract

Ein System zum mehrsprachigen Wissensmanagement semantischer Informationen wird vorgestellt: das Semantic Wiki Artificial Memory. Seine Grundidee wird kurz skizziert. Verschiedene aktuelle Probleme des technikgestützten Wissensmanagements werden erläutert und eine neuartige Kombination aus innovativen und bekannten Lösungen für diese Probleme veranschaulicht.

Content

"Bis vor wenigen Jahren waren kürzere physische Schriftstücke und Bücher die bevorzugten Mittel beim Gedankenverfassen und Gedankenaustausch. Dokumentenregister halfen beim Auffinden, Gliederungen unterstützten beim Zurechtfinden, ggf. assistierten Stichwortverzeichnisse beim Herauspicken. Diese inkrementelle Orientierung weicht zunehmend einer reinen Stichwortsuche in elektronischen Dokumentenkorpora, insbesondere dem WWW. Dokumentenregister, Gliederungen und Stichwortverzeichnisse werden von auf Wortindexen aufbauenden Suchmaschinen ausgehebelt. Das Suchergebnis verweist direkt auf einen einzelnen Textausschnitt (Snippet). Zurechtfinden im Dokument und Auffinden der richtigen Dokumente(nvorschläge) erfolgen nun, wenn überhaupt, in umgekehrter Reihenfolge und demgemäß unter Umständen sehr mühsam. Auf Anhieb erfolgreich ist eine solche Suche allerdings dann, wenn das Zieldokument auf das Stichwort völlig zugeschnitten erscheint, wenn also förmlich Textausschnitt, Kapitel und Dokument in eins fallen. Der Sog der Suchmaschinen zerschlägt die traditionelle sequentielle Dokumentengliederung, zerschlägt zuletzt das Dokument selbst in immer kleinere suchmaschinengerechte Einheiten. Auf solche Weise löst die Indexierung in Einzelwörter letztlich das Dokument selbst auf. Zurück bleibt allein eine Ansammlung indexgemäß geordneter Informationseinheiten: das Lexikon oder der Katalog. Im elektronisch gestützten Wissensmanagement nimmt nun das Wiki den Platz des Lexikons ein und der benamste Wikieintrag den Platz des Dokumentes."

Source

Semantic web & linked data: Elemente zukünftiger Informationsinfrastrukturen ; 1. DGI-Konferenz ; 62. Jahrestagung der DGI ; Frankfurt am Main, 7. - 9. Oktober 2010 ; Proceedings / Deutsche Gesellschaft für Informationswissenschaft und Informationspraxis. Hrsg.: M. Ockenfeld

Chan, L.M.; Lin, X.; Zeng, M.: Structural and multilingual approaches to subject access on the Web (1999) 0.02

0.020225711 = product of:
  0.12135427 = sum of:
    0.045669187 = weight(_text_:web in 162) [ClassicSimilarity], result of:
      0.045669187 = score(doc=162,freq=4.0), product of:
        0.111951075 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03430388 = queryNorm
        0.4079388 = fieldWeight in 162, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0625 = fieldNorm(doc=162)
    0.075685084 = weight(_text_:suche in 162) [ClassicSimilarity], result of:
      0.075685084 = score(doc=162,freq=2.0), product of:
        0.17138755 = queryWeight, product of:
          4.996156 = idf(docFreq=812, maxDocs=44218)
          0.03430388 = queryNorm
        0.441602 = fieldWeight in 162, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.996156 = idf(docFreq=812, maxDocs=44218)
          0.0625 = fieldNorm(doc=162)
  0.16666667 = coord(2/12)

Abstract: Zu den großen Herausforderungen einer sinnvollen Suche im WWW gehören die riesige Menge des Verfügbaren und die Sparchbarrieren. Verfahren, die die Web-Ressourcen im Hinblick auf ein effizienteres Retrieval inhaltlich strukturieren, werden daher ebenso dringend benötigt wie Programme, die mit der Sprachvielfalt umgehen können. Im folgenden Vortrag werden wir einige Ansätze diskutieren, die zur Bewältigung der beiden Probleme derzeit unternommen werden

Busch, D.: Organisation eines Thesaurus für die Unterstützung der mehrsprachigen Suche in einer bibliographischen Datenbank im Bereich Planen und Bauen (2016) 0.02
```
0.018900758 = product of:
  0.11340454 = sum of:
    0.094606355 = weight(_text_:suche in 3308) [ClassicSimilarity], result of:
      0.094606355 = score(doc=3308,freq=8.0), product of:
        0.17138755 = queryWeight, product of:
          4.996156 = idf(docFreq=812, maxDocs=44218)
          0.03430388 = queryNorm
        0.5520025 = fieldWeight in 3308, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          4.996156 = idf(docFreq=812, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3308)
    0.018798191 = weight(_text_:system in 3308) [ClassicSimilarity], result of:
      0.018798191 = score(doc=3308,freq=2.0), product of:
        0.10804188 = queryWeight, product of:
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.03430388 = queryNorm
        0.17398985 = fieldWeight in 3308, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3308)
  0.16666667 = coord(2/12)
```
Abstract

Das Problem der mehrsprachigen Suche gewinnt in der letzten Zeit immer mehr an Bedeutung, da viele nützliche Fachinformationen in der Welt in verschiedenen Sprachen publiziert werden. RSWBPlus ist eine bibliographische Datenbank zum Nachweis der Fachliteratur im Bereich Planen und Bauen, welche deutsch- und englischsprachige Metadaten-Einträge enthält. Bis vor Kurzem war es problematisch Einträge zu finden, deren Sprache sich von der Anfragesprache unterschied. Zum Beispiel fand man auf deutschsprachige Anfragen nur deutschsprachige Einträge, obwohl die Datenbank auch potenziell nützliche englischsprachige Einträge enthielt. Um das Problem zu lösen, wurde nach einer Untersuchung bestehender Ansätze, die RSWBPlus weiterentwickelt, um eine mehrsprachige (sprachübergreifende) Suche zu unterstützen, welche unter Einbeziehung eines zweisprachigen begriffbasierten Thesaurus erfolgt. Der Thesaurus wurde aus bereits bestehenden Thesauri automatisch gebildet. Die Einträge der Quell-Thesauri wurden in SKOS-Format (Simple Knowledge Organisation System) umgewandelt, automatisch miteinander vereinigt und schließlich in einen Ziel-Thesaurus eingespielt, der ebenfalls in SKOS geführt wird. Für den Zugriff zum Ziel-Thesaurus werden Apache Jena und MS SQL Server verwendet. Bei der mehrsprachigen Suche werden Terme der Anfrage durch entsprechende Übersetzungen und Synonyme in Deutsch und Englisch erweitert. Die Erweiterung der Suchterme kann sowohl in der Laufzeit, als auch halbautomatisch erfolgen. Das verbesserte Recherchesystem kann insbesondere deutschsprachigen Benutzern helfen, relevante englischsprachige Einträge zu finden. Die Verwendung vom SKOS erhöht die Interoperabilität der Thesauri, vereinfacht das Bilden des Ziel-Thesaurus und den Zugriff zu seinen Einträgen.

Airio, E.: Who benefits from CLIR in web retrieval? (2008) 0.02

0.018604595 = product of:
  0.07441838 = sum of:
    0.04194983 = weight(_text_:web in 2342) [ClassicSimilarity], result of:
      0.04194983 = score(doc=2342,freq=6.0), product of:
        0.111951075 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03430388 = queryNorm
        0.37471575 = fieldWeight in 2342, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=2342)
    0.009910721 = weight(_text_:information in 2342) [ClassicSimilarity], result of:
      0.009910721 = score(doc=2342,freq=4.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.16457605 = fieldWeight in 2342, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=2342)
    0.02255783 = weight(_text_:system in 2342) [ClassicSimilarity], result of:
      0.02255783 = score(doc=2342,freq=2.0), product of:
        0.10804188 = queryWeight, product of:
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.03430388 = queryNorm
        0.20878783 = fieldWeight in 2342, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.046875 = fieldNorm(doc=2342)
  0.25 = coord(3/12)

Abstract: Purpose - The aim of the current paper is to test whether query translation is beneficial in web retrieval. Design/methodology/approach - The language pairs were Finnish-Swedish, English-German and Finnish-French. A total of 12-18 participants were recruited for each language pair. Each participant performed four retrieval tasks. The author's aim was to compare the performance of the translated queries with that of the target language queries. Thus, the author asked participants to formulate a source language query and a target language query for each task. The source language queries were translated into the target language utilizing a dictionary-based system. In English-German, also machine translation was utilized. The author used Google as the search engine. Findings - The results differed depending on the language pair. The author concluded that the dictionary coverage had an effect on the results. On average, the results of query-translation were better than in the traditional laboratory tests. Originality/value - This research shows that query translation in web is beneficial especially for users with moderate and non-active language skills. This is valuable information for developers of cross-language information retrieval systems.

Subirats, I.; Prasad, A.R.D.; Keizer, J.; Bagdanov, A.: Implementation of rich metadata formats and demantic tools using DSpace (2008) 0.02
```
0.018267233 = product of:
  0.054801695 = sum of:
    0.016146496 = weight(_text_:web in 2656) [ClassicSimilarity], result of:
      0.016146496 = score(doc=2656,freq=2.0), product of:
        0.111951075 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03430388 = queryNorm
        0.14422815 = fieldWeight in 2656, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03125 = fieldNorm(doc=2656)
    0.008092071 = weight(_text_:information in 2656) [ClassicSimilarity], result of:
      0.008092071 = score(doc=2656,freq=6.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.1343758 = fieldWeight in 2656, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03125 = fieldNorm(doc=2656)
    0.021267725 = weight(_text_:system in 2656) [ClassicSimilarity], result of:
      0.021267725 = score(doc=2656,freq=4.0), product of:
        0.10804188 = queryWeight, product of:
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.03430388 = queryNorm
        0.19684705 = fieldWeight in 2656, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.03125 = fieldNorm(doc=2656)
    0.009295405 = product of:
      0.01859081 = sum of:
        0.01859081 = weight(_text_:22 in 2656) [ClassicSimilarity], result of:
          0.01859081 = score(doc=2656,freq=2.0), product of:
            0.120126344 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03430388 = queryNorm
            0.15476047 = fieldWeight in 2656, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03125 = fieldNorm(doc=2656)
      0.5 = coord(1/2)
  0.33333334 = coord(4/12)
```
Abstract

This poster explores the customization of DSpace to allow the use of the AGRIS Application Profile metadata standard and the AGROVOC thesaurus. The objective is the adaptation of DSpace, through the least invasive code changes either in the form of plug-ins or add-ons, to the specific needs of the Agricultural Sciences and Technology community. Metadata standards such as AGRIS AP, and Knowledge Organization Systems such as the AGROVOC thesaurus, provide mechanisms for sharing information in a standardized manner by recommending the use of common semantics and interoperable syntax (Subirats et al., 2007). AGRIS AP was created to enhance the description, exchange and subsequent retrieval of agricultural Document-like Information Objects (DLIOs). It is a metadata schema which draws from Metadata standards such as Dublin Core (DC), the Australian Government Locator Service Metadata (AGLS) and the Agricultural Metadata Element Set (AgMES) namespaces. It allows sharing of information across dispersed bibliographic systems (FAO, 2005). AGROVOC68 is a multilingual structured thesaurus covering agricultural and related domains. Its main role is to standardize the indexing process in order to make searching simpler and more efficient. AGROVOC is developed by FAO (Lauser et al., 2006). The customization of the DSpace is taking place in several phases. First, the AGRIS AP metadata schema was mapped onto the metadata DSpace model, with several enhancements implemented to support AGRIS AP elements. Next, AGROVOC will be integrated as a controlled vocabulary accessed through a local SKOS or OWL file. Eventually the system will be configurable to access AGROVOC through local files or remotely via webservices. Finally, spell checking and tooltips will be incorporated in the user interface to support metadata editing. Adapting DSpace to support AGRIS AP and annotation using the semantically-rich AGROVOC thesaurus transform DSpace into a powerful, domain-specific system for annotation and exchange of bibliographic metadata in the agricultural domain.

Source

Metadata for semantic and social applications : proceedings of the International Conference on Dublin Core and Metadata Applications, Berlin, 22 - 26 September 2008, DC 2008: Berlin, Germany / ed. by Jane Greenberg and Wolfgang Klas

Theme

Semantic Web

Garcia Jiménez, A.; Díaz Esteban, A.; Gervás, P.: Knowledge organization in a multilingual system for the personalization of digital news services : how to integrate knowledge (2003) 0.02

0.0180211 = product of:
  0.0720844 = sum of:
    0.02018312 = weight(_text_:web in 2748) [ClassicSimilarity], result of:
      0.02018312 = score(doc=2748,freq=2.0), product of:
        0.111951075 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03430388 = queryNorm
        0.18028519 = fieldWeight in 2748, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2748)
    0.014304894 = weight(_text_:information in 2748) [ClassicSimilarity], result of:
      0.014304894 = score(doc=2748,freq=12.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.23754507 = fieldWeight in 2748, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2748)
    0.037596382 = weight(_text_:system in 2748) [ClassicSimilarity], result of:
      0.037596382 = score(doc=2748,freq=8.0), product of:
        0.10804188 = queryWeight, product of:
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.03430388 = queryNorm
        0.3479797 = fieldWeight in 2748, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2748)
  0.25 = coord(3/12)

Abstract: In this paper we are concerned with the type of services that send periodic news selections to subscribers of a digital newspaper by means of electronic mail. The aims are to study the influence of categorisation in information retrieval and in digital newspapers, different models to solve problems of bilingualism in digital information services and to analyse the evaluation in information filtering and personalisation in information agents. Hermes is a multilingual system for the personalisation of news services which allows integration and categorisation of information in two languages. In order to customise information for each user, Hermes provides the means for representing a user interests homogeneously across the operating languages of the system. A simple system is applied to train automatically a dynamic news item classifier for both languages, by taking the Yahoo set of categories as reference framework and using the web pages classified under them as training collection. Traditional evaluation methods have been applied and their shortcomings for the present endeavour have been noted.

Soergel, D.: SemWeb: Proposal for an Open, multifunctional, multilingual system for integrated access to knowledge about concepts and terminology : exploration and development of the concept (1996) 0.02
```
0.017619023 = product of:
  0.07047609 = sum of:
    0.02018312 = weight(_text_:web in 3576) [ClassicSimilarity], result of:
      0.02018312 = score(doc=3576,freq=2.0), product of:
        0.111951075 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03430388 = queryNorm
        0.18028519 = fieldWeight in 3576, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3576)
    0.008258934 = weight(_text_:information in 3576) [ClassicSimilarity], result of:
      0.008258934 = score(doc=3576,freq=4.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.13714671 = fieldWeight in 3576, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3576)
    0.042034037 = weight(_text_:system in 3576) [ClassicSimilarity], result of:
      0.042034037 = score(doc=3576,freq=10.0), product of:
        0.10804188 = queryWeight, product of:
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.03430388 = queryNorm
        0.38905317 = fieldWeight in 3576, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3576)
  0.25 = coord(3/12)
```
Abstract

This paper presents a proposal for the long-range development of an open, multifunctional, multilingual system for integrated access to many kinds of knowledge about concepts and terminology. The system would draw on existing knowledge bases that are accessible through the Internet or on CD-ROM an on a common integrated distributed knowledge base that would grow incrementally over time. Existing knowledge bases would be accessed through a common interface that would search several knowledge bases, collate the data into a common format, and present them to the user. The common integrated distributed knowledge base would provide an environment in which many contributors could carry out classification and terminological projects more efficiently, with the results available in a common format. Over time, data from other knowledge bases could be incorporated into the common knowledge base, either by actual transfer (provided the knowledge base producers are willing) or by reference through a link. Either way, such incorporation requires intellectual work but allows for tighter integration than common interface access to multiple knowledge bases. Each piece of information in the common knowledge base will have all its sources attached, providing an acknowledgment mechanism that gives due credit to all contributors. The whole system woul be designed to be usable by many levels of users for improved information exchange.

Content

Expanded version of a paper published in Advances in Knowledge Organization v.5 (1996): 165-173 (4th Annual ISKO Conference, Washington, D.C., 1996 July 15-18): SemWeb: proposal for an open, multifunctional, multilingual system for integrated access to knowledge about concepts and terminology.

Theme

Semantic Web
Soergel, D.: SemWeb: proposal for an open, multifunctional, multilingual system for integrated access to knowledge about concepts and terminology (1996) 0.02
```
0.01650961 = product of:
  0.06603844 = sum of:
    0.02018312 = weight(_text_:web in 3575) [ClassicSimilarity], result of:
      0.02018312 = score(doc=3575,freq=2.0), product of:
        0.111951075 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03430388 = queryNorm
        0.18028519 = fieldWeight in 3575, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3575)
    0.008258934 = weight(_text_:information in 3575) [ClassicSimilarity], result of:
      0.008258934 = score(doc=3575,freq=4.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.13714671 = fieldWeight in 3575, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3575)
    0.037596382 = weight(_text_:system in 3575) [ClassicSimilarity], result of:
      0.037596382 = score(doc=3575,freq=8.0), product of:
        0.10804188 = queryWeight, product of:
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.03430388 = queryNorm
        0.3479797 = fieldWeight in 3575, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3575)
  0.25 = coord(3/12)
```
Abstract

Presents a proposal for the long-range development of an open, multifunctional, multilingual system for integrated access to many kinds of knowledge about concepts and terminology. The system would draw on existing knowledge bases that are accessible through the Internet or on CD-ROM and on a common integrated distributed knowledge base that would grow incrementally over time. Existing knowledge bases would be accessed througha common interface that would search several knowledge bases, collate the data into a common format, and present them to the user. The common integrated distributed knowldge base would provide an environment in which many contributors could carry out classification and terminological projects more efficiently, with the results available in a common format. Over time, data from other knowledge bases could be incorporated into the common knowledge base, either by actual transfer (provided the knowledge base producers are willing) or by reference through a link. Either way, such incorporation requires intellectual work but allows for tighter integration than common interface access to multiple knowledge bases. Each piece of information in the common knowledge base will have all its sources attached, providing an acknowledgment mechanism that gives due credit to all contributors. The whole system would be designed to be usable by many levels of users for improved information exchange.

Theme

Semantic Web

Reinisch, F.: Wer suchet - der findet? : oder Die Überwindung der sprachlichen Grenzen bei der Suche in Volltextdatenbanken (2000) 0.02

0.015712649 = product of:
  0.09427589 = sum of:
    0.075685084 = weight(_text_:suche in 4919) [ClassicSimilarity], result of:
      0.075685084 = score(doc=4919,freq=2.0), product of:
        0.17138755 = queryWeight, product of:
          4.996156 = idf(docFreq=812, maxDocs=44218)
          0.03430388 = queryNorm
        0.441602 = fieldWeight in 4919, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.996156 = idf(docFreq=812, maxDocs=44218)
          0.0625 = fieldNorm(doc=4919)
    0.01859081 = product of:
      0.03718162 = sum of:
        0.03718162 = weight(_text_:22 in 4919) [ClassicSimilarity], result of:
          0.03718162 = score(doc=4919,freq=2.0), product of:
            0.120126344 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03430388 = queryNorm
            0.30952093 = fieldWeight in 4919, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=4919)
      0.5 = coord(1/2)
  0.16666667 = coord(2/12)

Date: 22. 7.2000 17:48:06

Nichols, D.M.; Witten, I.H.; Keegan, T.T.; Bainbridge, D.; Dewsnip, M.: Digital libraries and minority languages (2005) 0.02

0.015687441 = product of:
  0.062749766 = sum of:
    0.02825637 = weight(_text_:web in 5914) [ClassicSimilarity], result of:
      0.02825637 = score(doc=5914,freq=2.0), product of:
        0.111951075 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03430388 = queryNorm
        0.25239927 = fieldWeight in 5914, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0546875 = fieldNorm(doc=5914)
    0.008175928 = weight(_text_:information in 5914) [ClassicSimilarity], result of:
      0.008175928 = score(doc=5914,freq=2.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.13576832 = fieldWeight in 5914, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0546875 = fieldNorm(doc=5914)
    0.026317468 = weight(_text_:system in 5914) [ClassicSimilarity], result of:
      0.026317468 = score(doc=5914,freq=2.0), product of:
        0.10804188 = queryWeight, product of:
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.03430388 = queryNorm
        0.2435858 = fieldWeight in 5914, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.0546875 = fieldNorm(doc=5914)
  0.25 = coord(3/12)

Abstract: Digital libraries have a pivotal role to play in the preservation and maintenance of international cultures in general and minority languages in particular. This paper outlines a software tool for building digital libraries that is well adapted for creating and distributing local information collections in minority languages, and describes some contexts in which it is used. The system can make multilingual documents available in structured collections and allows them to be accessed via multilingual interfaces. It is issued under a free open-source licence, which encourages participatory design of the software, and an end-user interface allows community-based localization of the various language interfaces-of which there are many.
Content: Beitrag in einem Themenheft "Minority languages, multimedia and the Web"

Lassalle, E.: Text retrieval : from a monolingual system to a multilingual system (1993) 0.01

0.013768828 = product of:
  0.08261296 = sum of:
    0.008175928 = weight(_text_:information in 7403) [ClassicSimilarity], result of:
      0.008175928 = score(doc=7403,freq=2.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.13576832 = fieldWeight in 7403, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0546875 = fieldNorm(doc=7403)
    0.07443704 = weight(_text_:system in 7403) [ClassicSimilarity], result of:
      0.07443704 = score(doc=7403,freq=16.0), product of:
        0.10804188 = queryWeight, product of:
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.03430388 = queryNorm
        0.68896466 = fieldWeight in 7403, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          3.1495528 = idf(docFreq=5152, maxDocs=44218)
          0.0546875 = fieldNorm(doc=7403)
  0.16666667 = coord(2/12)

Abstract: Describes the TELMI monolingual text retrieval system and its future extension, a multilingual system. TELMI is designed for medium sized databases containing short texts. The characteristics of the system are fine-grained natural language processing (NLP); an open domain and a large scale knowledge base; automated indexing based on conceptual representation of texts and reusability of the NLP tools. Discusses the French MINITEL service, the MGS information service and the TELMI research system covering the full text system; NLP architecture; the lexical level; the syntactic level; the semantic level and an example of the use of a generic system

Rettinger, A.; Schumilin, A.; Thoma, S.; Ell, B.: Learning a cross-lingual semantic representation of relations expressed in text (2015) 0.01

0.01359938 = product of:
  0.08159628 = sum of:
    0.06991638 = weight(_text_:web in 2027) [ClassicSimilarity], result of:
      0.06991638 = score(doc=2027,freq=6.0), product of:
        0.111951075 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03430388 = queryNorm
        0.6245262 = fieldWeight in 2027, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.078125 = fieldNorm(doc=2027)
    0.011679897 = weight(_text_:information in 2027) [ClassicSimilarity], result of:
      0.011679897 = score(doc=2027,freq=2.0), product of:
        0.060219705 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03430388 = queryNorm
        0.19395474 = fieldWeight in 2027, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.078125 = fieldNorm(doc=2027)
  0.16666667 = coord(2/12)

Series: Information Systems and Applications, incl. Internet/Web, and HCI; Bd. 9088
Source: The Semantic Web: latest advances and new domains. 12th European Semantic Web Conference, ESWC 2015 Portoroz, Slovenia, May 31 -- June 4, 2015. Proceedings. Eds.: F. Gandon u.a

Search (243 results, page 1 of 13)

Authors

Years

Languages

Types

Themes

Subjects

Classifications