Search (15 results, page 1 of 1)

Scott, D.S.: Subject classification and natural-language processing for retrieval in large databases (1989) 0.07

0.06842273 = product of:
  0.10263409 = sum of:
    0.03603666 = weight(_text_:im in 967) [ClassicSimilarity], result of:
      0.03603666 = score(doc=967,freq=2.0), product of:
        0.1442303 = queryWeight, product of:
          2.8267863 = idf(docFreq=7115, maxDocs=44218)
          0.051022716 = queryNorm
        0.24985497 = fieldWeight in 967, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.8267863 = idf(docFreq=7115, maxDocs=44218)
          0.0625 = fieldNorm(doc=967)
    0.066597424 = product of:
      0.09989613 = sum of:
        0.0415382 = weight(_text_:online in 967) [ClassicSimilarity], result of:
          0.0415382 = score(doc=967,freq=2.0), product of:
            0.1548489 = queryWeight, product of:
              3.0349014 = idf(docFreq=5778, maxDocs=44218)
              0.051022716 = queryNorm
            0.2682499 = fieldWeight in 967, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.0349014 = idf(docFreq=5778, maxDocs=44218)
              0.0625 = fieldNorm(doc=967)
        0.058357935 = weight(_text_:retrieval in 967) [ClassicSimilarity], result of:
          0.058357935 = score(doc=967,freq=4.0), product of:
            0.15433937 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.051022716 = queryNorm
            0.37811437 = fieldWeight in 967, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0625 = fieldNorm(doc=967)
      0.6666667 = coord(2/3)
  0.6666667 = coord(2/3)

Theme: Klassifikationssysteme im Online-Retrieval

Tudhope, D.; Binding, C.; Blocks, D.; Cuncliffe, D.: Representation and retrieval in faceted systems (2003) 0.06

0.060477715 = product of:
  0.09071657 = sum of:
    0.031852208 = weight(_text_:im in 2703) [ClassicSimilarity], result of:
      0.031852208 = score(doc=2703,freq=4.0), product of:
        0.1442303 = queryWeight, product of:
          2.8267863 = idf(docFreq=7115, maxDocs=44218)
          0.051022716 = queryNorm
        0.22084267 = fieldWeight in 2703, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          2.8267863 = idf(docFreq=7115, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2703)
    0.058864366 = product of:
      0.08829655 = sum of:
        0.036714934 = weight(_text_:online in 2703) [ClassicSimilarity], result of:
          0.036714934 = score(doc=2703,freq=4.0), product of:
            0.1548489 = queryWeight, product of:
              3.0349014 = idf(docFreq=5778, maxDocs=44218)
              0.051022716 = queryNorm
            0.23710167 = fieldWeight in 2703, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.0349014 = idf(docFreq=5778, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2703)
        0.051581617 = weight(_text_:retrieval in 2703) [ClassicSimilarity], result of:
          0.051581617 = score(doc=2703,freq=8.0), product of:
            0.15433937 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.051022716 = queryNorm
            0.33420905 = fieldWeight in 2703, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2703)
      0.6666667 = coord(2/3)
  0.6666667 = coord(2/3)

Abstract: This paper discusses two inter-related themes: the retrieval potential of faceted thesauri and XML representations of fundamental facets. Initial findings are discussed from the ongoing 'FACET' project, in collaboration with the National Museum of Science and Industry. The work discussed seeks to take advantage of the structure afforded by faceted systems for multi-term queries and flexible matching, focusing in this paper an the Art and Architecture Thesaurus. A multi-term matching function yields ranked results with partial matches via semantic term expansion, based an a measure of distance over the semantic index space formed by thesaurus relationships. Our intention is to drive the system from general representations and a common query structure and interface. To this end, we are developing an XML representation based an work by the Classification Research Group an fundamental facets or categories. The XML representation maps categories to particular thesauri and hierarchies. The system interface, which is configured by the mapping, incorporates a thesaurus browser with navigation history together with a term search facility and drag and drop query builder.
Theme: Klassifikationssysteme im Online-Retrieval
Verbale Doksprachen im Online-Retrieval

Bee, G.: CrissCross (2006) 0.03
```
0.02754106 = product of:
  0.04131159 = sum of:
    0.035253935 = weight(_text_:im in 1275) [ClassicSimilarity], result of:
      0.035253935 = score(doc=1275,freq=10.0), product of:
        0.1442303 = queryWeight, product of:
          2.8267863 = idf(docFreq=7115, maxDocs=44218)
          0.051022716 = queryNorm
        0.24442805 = fieldWeight in 1275, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          2.8267863 = idf(docFreq=7115, maxDocs=44218)
          0.02734375 = fieldNorm(doc=1275)
    0.006057655 = product of:
      0.018172964 = sum of:
        0.018172964 = weight(_text_:online in 1275) [ClassicSimilarity], result of:
          0.018172964 = score(doc=1275,freq=2.0), product of:
            0.1548489 = queryWeight, product of:
              3.0349014 = idf(docFreq=5778, maxDocs=44218)
              0.051022716 = queryNorm
            0.11735933 = fieldWeight in 1275, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.0349014 = idf(docFreq=5778, maxDocs=44218)
              0.02734375 = fieldNorm(doc=1275)
      0.33333334 = coord(1/3)
  0.6666667 = coord(2/3)
```
Content

"»Simplify your life«, heißt einer der großen Sachbuchbestseller der letzten Jahre. Dessen Quintessenz: In einer zusehends komplexer werdenden Welt sind Vereinfachungen (über)lebensnotwendig. Dies gilt für alle Lebensbereiche, in besonderem Maße aber für die Suche nach Informationen. Um Bibliotheksbenutzer an die von ihnen gewünschte Literatur heranzuführen, kann seitens der Bibliotheken sicher noch einiges an Vereinfachungsleistung erbracht werden - insbesondere dann, wenn der Benutzer sich bei der Suche nicht mit den Beständen seines nationalen Bibliothekssystems begnügt, sondern die Möglichkeit nutzt, weltweit online zu recherchieren. Eine Suche unter Zuhilfenahme von Titelstichwörtern bereitet heute nur wenig Kopfzerbrechen, ist aber oft unbefriedigend für die sachliche Suche. Leider gestaltet sich der hier deutlich zielführende Rückgriff auf Sacherschließungsdaten in der Praxis erheblich schwieriger. Der Benutzer ist mit einer Vielzahl unterschiedlicher Erschließungssysteme konfrontiert, die vielfach unverbunden nebeneinander existieren. Da die Bestände der weitaus meisten Bibliotheken heterogen erschlossen sind, müsste der Benutzer alle Erschließungsverfahren kennen und nacheinander anwenden, um sicherzustellen, dass ihm keine Informationen verloren gehen.
An diesem Punkt setzt das neue Projekt CrissCross an, das gemeinsam von der Deutschen Nationalbibliothek (DNB) und der Fachhochschule Köln betrieben und von der Deutschen Forschungsgemeinschaft (DFG) gefördert wird. CrissCross schafft Verbindungslinien zwischen einigen besonders verbreiteten und bewährten Erschließungsinstrumenten: Der deutschsprachigen Schlagwortnormdatei (SWD), den besonders im angloamerikanischen Raum verbreiteten Library of Congress Subject Headings (LCSH), der französischen Dokumentationssprache Répertoire d'autorité-matière encyclopédique et alphabétique unifié (Rameau) und der Dewey Decimal Classification (DDC) als dem international am meisten verbreiteten Klassifikationssystem. CrissCross kann dabei auf zwei wichtigen Vorgängerprojekten aufbauen. Hier ist zum einen das ebenfalls von DNB und der Fachhochschule Köln gemeinsam betriebene Vorgängerprojekt »DDC Deutsch« zu nennen. Bereits während dessen Laufzeit war deutlich geworden, wie vorteilhaft sich eine Anreicherung des DDC-Registers mit dem Schlagwortbestand der SWD auf die Klassifikationspraxis auswirken könnte. Genau dieser Schritt wird nun vollzogen, wobei sich die Projektarbeit auf die Sachschlagwörter konzentriert - kein kleines Unterfangen, beläuft sich doch die Zahl der zurzeit in der SWD befindlichen Datensätze mit dem Indikator s auf über 154.000.
Ein weiteres Fundament sind die Vorarbeiten aus dem gemeinsam von DNB, der Schweizerischen Landesbibliothek und anderen Partnern betriebenen Projekt Multilingual Access to Subjects (MACS). Im Rahmen von MACS wurden in größerem Umfang LCSH und Rameau-Datensätze miteinander verknüpft; einigen der dabei entstandenen Schlagwortpärchen wurde dabei bereits ein SWD-Äquivalent zugeordnet. Dreierverbindungen dieses Typs, erweitert um DDC-Notationen, werden auch im Rahmen von CrissCross entstehen. Die unterschiedliche Strukturierung der einzelnen Schlagwortsprachen sorgt allerdings für eine Vielzahl von Problemen, die das Projektteam bewältigen muss. Durch die Verlinkung der Schlagwortsprachen und die Verknüpfung mit der DDC erstellt CrissCross ein multilinguales verbales Recherchevokabular. Ungewöhnlich und innovativ ist dabei vor allem die Verbindung von verbaler und klassifikatorischer Sacherschließung, die zu einer bedeutenden Erweiterung der Recherchemöglichkeiten führen wird. Dies wird vor allem dann der Fall sein, wenn in Onlineumgebungen nutzbare Hilfsmittel wie das von der DNB zurzeit entwickelte Normdatenrecherchetool zur Verfügung stehen. In naher Zukunft wird der Nutzer dann bei seiner Suche auf eine Vielzahl unterschiedlich erschlossener Werke stoßen, ohne dass ihm die im Hintergrund vollzogene Auswertung von Normdaten bewusst sein muss. Er braucht im Grunde genommen überhaupt kein Vorwissen über Sacherschließungsinstrumente mitzubringen. »Simplify your search«, könnte das Motto dieser Vorgehensweise lauten. Die Nutzerfreundlichkeit der Bibliotheken wird durch den Einsatz derartiger Instrumente eine deutliche Steigerung erfahren. CrissCross wird seinen Teil dazu beitragen."
Balikova, M.: Multilingual Subject Access to Catalogues of National Libraries (MSAC) : Czech Republic's collaboration with Slovakia, Slovenia, Croatia, Macedonia, Lithuania and Latvia (2005) 0.02
```
0.018246476 = product of:
  0.054739427 = sum of:
    0.054739427 = product of:
      0.08210914 = sum of:
        0.035973143 = weight(_text_:online in 4349) [ClassicSimilarity], result of:
          0.035973143 = score(doc=4349,freq=6.0), product of:
            0.1548489 = queryWeight, product of:
              3.0349014 = idf(docFreq=5778, maxDocs=44218)
              0.051022716 = queryNorm
            0.23231125 = fieldWeight in 4349, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.0349014 = idf(docFreq=5778, maxDocs=44218)
              0.03125 = fieldNorm(doc=4349)
        0.046136 = weight(_text_:retrieval in 4349) [ClassicSimilarity], result of:
          0.046136 = score(doc=4349,freq=10.0), product of:
            0.15433937 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.051022716 = queryNorm
            0.29892567 = fieldWeight in 4349, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.03125 = fieldNorm(doc=4349)
      0.6666667 = coord(2/3)
  0.33333334 = coord(1/3)
```
Abstract

Czech authority file of topical terms is intended to form a base for multilingual controlled vocabulary. The aim of the proposal is to provide users of online library catalogues and internet services of cooperating institutions with an indexing and retrieval tool which enables multilingual and cross-domain searching ("one-stop" seamless searching). The goal of the project is to establish a multilingual subject approach to catalogues of participating libraries (Czechia, Croatia, Latvia, Lithuania, Macedonia, Slovakia, and Slovenia). In practice this means that a user in any of these countries would enter a query in his local language and receive hits from all the catalogues. The initiative is complying with the main goals currently defined by IFLA for the activity of Indexing and Classification Section, it means: Changing Roles of Subject Access Tools (Berlin), Implementation and Adaptation of Global Tools for Subject Access to Local Needs (Buenos Aires), and Cataloguing and Subject Tools for Global Access: International Partnerships (Oslo).

Content

The aim of this initiative is to provide the users of online library catalogues and information gateways of cooperating libraries with a prototype for multilingual subject searching in online environment. Library collections of these libraries are large and without any doubt very valuable for researchers throughout Europe. What is needed is a standardized, authorized indexing and retrieval tool which would bring together all their catalogues and databases and enable multilingual subject searching. At the beginning of the project, a number of factors affecting subject indexing in current environment and cross-searching for subjects have been identified. These factors include - standardization of subject retrieval process and indexing and classification tools - subject retrieval methods - possibility of interoperability among different indexing and classification schemes - multilingualism issue - possibility to increase precision and recall trough Z39.50 protocol and its profiles and to apply authority control in subject retrieval process - need for cooperation

Panzer, M.: DDC in Germany : Recent Developments and Current Activities (2007) 0.02

0.01801833 = product of:
  0.054054987 = sum of:
    0.054054987 = weight(_text_:im in 98) [ClassicSimilarity], result of:
      0.054054987 = score(doc=98,freq=2.0), product of:
        0.1442303 = queryWeight, product of:
          2.8267863 = idf(docFreq=7115, maxDocs=44218)
          0.051022716 = queryNorm
        0.37478244 = fieldWeight in 98, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.8267863 = idf(docFreq=7115, maxDocs=44218)
          0.09375 = fieldNorm(doc=98)
  0.33333334 = coord(1/3)

Content: Vortrag auf dem ALA Midwinter Meeting im Januar 2007 in Seattle.

Root thesaurus. Pt.1.2 (1985) 0.01

0.009217165 = product of:
  0.027651494 = sum of:
    0.027651494 = product of:
      0.08295448 = sum of:
        0.08295448 = weight(_text_:22 in 467) [ClassicSimilarity], result of:
          0.08295448 = score(doc=467,freq=2.0), product of:
            0.17867287 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.051022716 = queryNorm
            0.46428138 = fieldWeight in 467, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=467)
      0.33333334 = coord(1/3)
  0.33333334 = coord(1/3)

Date: 18. 5.2007 14:22:43

Karg, H.: Mapping Dewey and subject authorities : CrissCross (2007) 0.01

0.009217165 = product of:
  0.027651494 = sum of:
    0.027651494 = product of:
      0.08295448 = sum of:
        0.08295448 = weight(_text_:22 in 559) [ClassicSimilarity], result of:
          0.08295448 = score(doc=559,freq=2.0), product of:
            0.17867287 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.051022716 = queryNorm
            0.46428138 = fieldWeight in 559, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=559)
      0.33333334 = coord(1/3)
  0.33333334 = coord(1/3)

Content: Vortrag anläasslich des Workshops: "Extending the multilingual capacity of The European Library in the EDL project Stockholm, Swedish National Library, 22-23 November 2007".

Aitchison, J.: ¬The thesaurofacet. A multipurpose retrieval language tool (1970) 0.01

0.009170066 = product of:
  0.027510196 = sum of:
    0.027510196 = product of:
      0.08253059 = sum of:
        0.08253059 = weight(_text_:retrieval in 460) [ClassicSimilarity], result of:
          0.08253059 = score(doc=460,freq=2.0), product of:
            0.15433937 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.051022716 = queryNorm
            0.5347345 = fieldWeight in 460, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.125 = fieldNorm(doc=460)
      0.33333334 = coord(1/3)
  0.33333334 = coord(1/3)

Wilson, T.D.: ¬The work of the British Classification Research Group (1972) 0.01

0.006877549 = product of:
  0.020632647 = sum of:
    0.020632647 = product of:
      0.06189794 = sum of:
        0.06189794 = weight(_text_:retrieval in 2766) [ClassicSimilarity], result of:
          0.06189794 = score(doc=2766,freq=2.0), product of:
            0.15433937 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.051022716 = queryNorm
            0.40105087 = fieldWeight in 2766, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.09375 = fieldNorm(doc=2766)
      0.33333334 = coord(1/3)
  0.33333334 = coord(1/3)

Source: Subject retrieval in the seventies: new directions. Proc. of an int. symp. ... College Park, 14.-15.5.1971. Ed.: H.H. Wellisch u.a

Francu, V.: ¬The impact of specificity on the retrieval power of a UDC-based multilingual thesaurus (2003) 0.01
```
0.006877549 = product of:
  0.020632647 = sum of:
    0.020632647 = product of:
      0.06189794 = sum of:
        0.06189794 = weight(_text_:retrieval in 5518) [ClassicSimilarity], result of:
          0.06189794 = score(doc=5518,freq=8.0), product of:
            0.15433937 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.051022716 = queryNorm
            0.40105087 = fieldWeight in 5518, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.046875 = fieldNorm(doc=5518)
      0.33333334 = coord(1/3)
  0.33333334 = coord(1/3)
```
Abstract

The article describes the research done over a bibliographic database in order to show the impact the specificity of the knowledge organising tools may have on information retrieval (IR). For this purpose two multilingual Universal Decimal Classification (UDC) based thesauri having different degrees of specificity are considered. Issues of harmonising a classificatory structure with a thesaurus structure are introduced, and significant aspects of information retrieval in a multilingual environment are examined in an extensive manner. Aspects of complementarity are discussed with particular emphasis on the real impact produced on IR by alternative search facilities. Finally, a number of conclusions are formulated as they arise from the study.

Content

Beitrag eines Themenheftes "Knowledge organization and classification in international information retrieval"

Cochrane, P.A.: Subject access - free text and controlled : the case of Papua New Guinea (1985) 0.01

0.0065270998 = product of:
  0.0195813 = sum of:
    0.0195813 = product of:
      0.058743894 = sum of:
        0.058743894 = weight(_text_:online in 1459) [ClassicSimilarity], result of:
          0.058743894 = score(doc=1459,freq=4.0), product of:
            0.1548489 = queryWeight, product of:
              3.0349014 = idf(docFreq=5778, maxDocs=44218)
              0.051022716 = queryNorm
            0.37936267 = fieldWeight in 1459, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.0349014 = idf(docFreq=5778, maxDocs=44218)
              0.0625 = fieldNorm(doc=1459)
      0.33333334 = coord(1/3)
  0.33333334 = coord(1/3)

Abstract: The online catalogue can provide the user with efficient and effective access through a variety of access points. New interests in subject heading is indicated. Keyword access and free text searching are considered alternatice methods. An investigation is suggested into the symbiotic relationship between classification and subject heading
Source: Online public access to library files. Conf. Proc. held at the Univ. of Bath, 3.-5.9.1984. Ed.: J. Kinsella

Raghavan, K.S.: ¬The general theory of classification as the basis for structuring of subject headings (1985(?)) 0.00

0.004585033 = product of:
  0.013755098 = sum of:
    0.013755098 = product of:
      0.041265294 = sum of:
        0.041265294 = weight(_text_:retrieval in 1830) [ClassicSimilarity], result of:
          0.041265294 = score(doc=1830,freq=2.0), product of:
            0.15433937 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.051022716 = queryNorm
            0.26736724 = fieldWeight in 1830, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0625 = fieldNorm(doc=1830)
      0.33333334 = coord(1/3)
  0.33333334 = coord(1/3)

Abstract: Defines the basic functions of surrogate files in information retrieval. Exemplifies the categories enunciated in the general theory of classification developed in India. Defines the principles for structuring the concepts. Formulates set of general postulates pertaining to the structure of compound subjects. On the basis of these, outlines a procedure for structuring of subject headings. Demonstrates the application of procedure through examples

Tudhope, D.; Binding, C.; Blocks, D.; Cunliffe, D.: Compound descriptors in context : a matching function for classifications and thesauri (2002) 0.00
```
0.004052635 = product of:
  0.012157904 = sum of:
    0.012157904 = product of:
      0.03647371 = sum of:
        0.03647371 = weight(_text_:retrieval in 3179) [ClassicSimilarity], result of:
          0.03647371 = score(doc=3179,freq=4.0), product of:
            0.15433937 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.051022716 = queryNorm
            0.23632148 = fieldWeight in 3179, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3179)
      0.33333334 = coord(1/3)
  0.33333334 = coord(1/3)
```
Abstract

There are many advantages for Digital Libraries in indexing with classifications or thesauri, but some current disincentive in the lack of flexible retrieval tools that deal with compound descriptors. This paper discusses a matching function for compound descriptors, or multi-concept subject headings, that does not rely an exact matching but incorporates term expansion via thesaurus semantic relationships to produce ranked results that take account of missing and partially matching terms. The matching function is based an a measure of semantic closeness between terms, which has the potential to help with recall problems. The work reported is part of the ongoing FACET project in collaboration with the National Museum of Science and Industry and its collections database. The architecture of the prototype system and its Interface are outlined. The matching problem for compound descriptors is reviewed and the FACET implementation described. Results are discussed from scenarios using the faceted Getty Art and Architecture Thesaurus. We argue that automatic traversal of thesaurus relationships can augment the user's browsing possibilities. The techniques can be applied both to unstructured multi-concept subject headings and potentially to more syntactically structured strings. The notion of a focus term is used by the matching function to model AAT modified descriptors (noun phrases). The relevance of the approach to precoordinated indexing and matching faceted strings is discussed.

Theme

Semantisches Umfeld in Indexierung u. Retrieval
Tudhope, D.; Binding, C.; Blocks, D.; Cunliffe, D.: FACET: thesaurus retrieval with semantic term expansion (2002) 0.00
```
0.003970755 = product of:
  0.011912264 = sum of:
    0.011912264 = product of:
      0.03573679 = sum of:
        0.03573679 = weight(_text_:retrieval in 175) [ClassicSimilarity], result of:
          0.03573679 = score(doc=175,freq=6.0), product of:
            0.15433937 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.051022716 = queryNorm
            0.23154683 = fieldWeight in 175, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.03125 = fieldNorm(doc=175)
      0.33333334 = coord(1/3)
  0.33333334 = coord(1/3)
```
Abstract

There are many advantages for Digital Libraries in indexing with classifications or thesauri, but some current disincentive in the lack of flexible retrieval tools that deal with compound descriptors. This demonstration of a research prototype illustrates a matching function for compound descriptors, or multi-concept subject headings, that does not rely on exact matching but incorporates term expansion via thesaurus semantic relationships to produce ranked results that take account of missing and partially matching terms. The matching function is based on a measure of semantic closeness between terms.The work is part of the EPSRC funded FACET project in collaboration with the UK National Museum of Science and Industry (NMSI) which includes the National Railway Museum. An export of NMSI's Collections Database is used as the dataset for the research. The J. Paul Getty Trust's Art and Architecture Thesaurus (AAT) is the main thesaurus in the project. The AAT is a widely used thesaurus (over 120,000 terms). Descriptors are organised in 7 facets representing separate conceptual classes of terms.The FACET application is a multi tiered architecture accessing a SQL Server database, with an OLE DB connection. The thesauri are stored as relational tables in the Server's database. However, a key component of the system is a parallel representation of the underlying semantic network as an in-memory structure of thesaurus concepts (corresponding to preferred terms). The structure models the hierarchical and associative interrelationships of thesaurus concepts via weighted poly-hierarchical links. Its primary purpose is real-time semantic expansion of query terms, achieved by a spreading activation semantic closeness algorithm. Queries with associated results are stored persistently using XML format data. A Visual Basic interface combines a thesaurus browser and an initial term search facility that takes into account equivalence relationships. Terms are dragged to a direct manipulation Query Builder which maintains the facet structure.

Theme

Semantisches Umfeld in Indexierung u. Retrieval
Frâncu, V.: ¬A universal classification system going through changes (2001) 0.00
```
0.0028845975 = product of:
  0.008653793 = sum of:
    0.008653793 = product of:
      0.025961377 = sum of:
        0.025961377 = weight(_text_:online in 1593) [ClassicSimilarity], result of:
          0.025961377 = score(doc=1593,freq=2.0), product of:
            0.1548489 = queryWeight, product of:
              3.0349014 = idf(docFreq=5778, maxDocs=44218)
              0.051022716 = queryNorm
            0.16765618 = fieldWeight in 1593, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.0349014 = idf(docFreq=5778, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1593)
      0.33333334 = coord(1/3)
  0.33333334 = coord(1/3)
```
Abstract

In the early 1990s, indexing with classification codes from the Universal Decimal Classification (UDC) in an academic library, going from traditional to automated routines in any and all its activities, suddenly proved insufficient. Under the circumstances of online searching, the possibilities offered by the new OPAC looked much more attractive to indexers and searchers alike. Therefore, a quick shift to indexing with UDC and keywords instead of UDC numbers alone was made. Currency, precision and, more importantly, user-friendliness were strong advantages offered by keyword indexing and searching. But the larger the dictionary of keywords, the more problematic the consequences an information scattering, given the lack of control an terms. The present paper describes the advantages of the UDC in indexing by presenting some of the devices it is provided with: subdivision by analogy, common and special auxiliaries, use of synthesis, and use of connecting symbols. The solution of indexing with both UDC notation and words from a thesaurus based an UDC was prompted by some other characteristics of the schedules: a semi-faceted classification system, hierarchical organisation, richness in terminology and consistency and control of notation. The methodology used in building the thesaurus is conceived according to the international standards (ISO 2788 and 5964) to which some principles have been added, giving the specific approach of harmonising a classification structure with that of a thesaurus. Compatibility and translatability issues are also considered and some problems arising from them are treated in detail. Most of the problems discussed are illustrated with examples.

Search (15 results, page 1 of 1)

Authors

Years

Types

Themes

Subjects