Search (39 results, page 1 of 2)

Gabler, S.: Vergabe von DDC-Sachgruppen mittels eines Schlagwort-Thesaurus (2021) 0.09
```
0.086690515 = sum of:
  0.069130175 = product of:
    0.20739052 = sum of:
      0.20739052 = weight(_text_:3a in 1000) [ClassicSimilarity], result of:
        0.20739052 = score(doc=1000,freq=2.0), product of:
          0.4428125 = queryWeight, product of:
            8.478011 = idf(docFreq=24, maxDocs=44218)
            0.052230705 = queryNorm
          0.46834838 = fieldWeight in 1000, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            8.478011 = idf(docFreq=24, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1000)
    0.33333334 = coord(1/3)
  0.007550162 = weight(_text_:in in 1000) [ClassicSimilarity], result of:
    0.007550162 = score(doc=1000,freq=4.0), product of:
      0.07104705 = queryWeight, product of:
        1.3602545 = idf(docFreq=30841, maxDocs=44218)
        0.052230705 = queryNorm
      0.10626988 = fieldWeight in 1000, product of:
        2.0 = tf(freq=4.0), with freq of:
          4.0 = termFreq=4.0
        1.3602545 = idf(docFreq=30841, maxDocs=44218)
        0.0390625 = fieldNorm(doc=1000)
  0.010010177 = product of:
    0.020020355 = sum of:
      0.020020355 = weight(_text_:science in 1000) [ClassicSimilarity], result of:
        0.020020355 = score(doc=1000,freq=2.0), product of:
          0.1375819 = queryWeight, product of:
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.052230705 = queryNorm
          0.1455159 = fieldWeight in 1000, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1000)
    0.5 = coord(1/2)
```
Abstract

Vorgestellt wird die Konstruktion eines thematisch geordneten Thesaurus auf Basis der Sachschlagwörter der Gemeinsamen Normdatei (GND) unter Nutzung der darin enthaltenen DDC-Notationen. Oberste Ordnungsebene dieses Thesaurus werden die DDC-Sachgruppen der Deutschen Nationalbibliothek. Die Konstruktion des Thesaurus erfolgt regelbasiert unter der Nutzung von Linked Data Prinzipien in einem SPARQL Prozessor. Der Thesaurus dient der automatisierten Gewinnung von Metadaten aus wissenschaftlichen Publikationen mittels eines computerlinguistischen Extraktors. Hierzu werden digitale Volltexte verarbeitet. Dieser ermittelt die gefundenen Schlagwörter über Vergleich der Zeichenfolgen Benennungen im Thesaurus, ordnet die Treffer nach Relevanz im Text und gibt die zugeordne-ten Sachgruppen rangordnend zurück. Die grundlegende Annahme dabei ist, dass die gesuchte Sachgruppe unter den oberen Rängen zurückgegeben wird. In einem dreistufigen Verfahren wird die Leistungsfähigkeit des Verfahrens validiert. Hierzu wird zunächst anhand von Metadaten und Erkenntnissen einer Kurzautopsie ein Goldstandard aus Dokumenten erstellt, die im Online-Katalog der DNB abrufbar sind. Die Dokumente vertei-len sich über 14 der Sachgruppen mit einer Losgröße von jeweils 50 Dokumenten. Sämtliche Dokumente werden mit dem Extraktor erschlossen und die Ergebnisse der Kategorisierung do-kumentiert. Schließlich wird die sich daraus ergebende Retrievalleistung sowohl für eine harte (binäre) Kategorisierung als auch eine rangordnende Rückgabe der Sachgruppen beurteilt.

Content

Master thesis Master of Science (Library and Information Studies) (MSc), Universität Wien. Advisor: Christoph Steiner. Vgl.: https://www.researchgate.net/publication/371680244_Vergabe_von_DDC-Sachgruppen_mittels_eines_Schlagwort-Thesaurus. DOI: 10.25365/thesis.70030. Vgl. dazu die Präsentation unter: https://www.google.com/url?sa=i&rct=j&q=&esrc=s&source=web&cd=&ved=0CAIQw7AJahcKEwjwoZzzytz_AhUAAAAAHQAAAAAQAg&url=https%3A%2F%2Fwiki.dnb.de%2Fdownload%2Fattachments%2F252121510%2FDA3%2520Workshop-Gabler.pdf%3Fversion%3D1%26modificationDate%3D1671093170000%26api%3Dv2&psig=AOvVaw0szwENK1or3HevgvIDOfjx&ust=1687719410889597&opi=89978449.

Karg, H.: Mapping Dewey and subject authorities : CrissCross (2007) 0.04

0.03684819 = product of:
  0.055272285 = sum of:
    0.012813049 = weight(_text_:in in 559) [ClassicSimilarity], result of:
      0.012813049 = score(doc=559,freq=2.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.18034597 = fieldWeight in 559, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.09375 = fieldNorm(doc=559)
    0.042459235 = product of:
      0.08491847 = sum of:
        0.08491847 = weight(_text_:22 in 559) [ClassicSimilarity], result of:
          0.08491847 = score(doc=559,freq=2.0), product of:
            0.18290302 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.052230705 = queryNorm
            0.46428138 = fieldWeight in 559, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=559)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Content: Vortrag anläasslich des Workshops: "Extending the multilingual capacity of The European Library in the EDL project Stockholm, Swedish National Library, 22-23 November 2007".

Unesco thesaurus : a structured list of descriptors for indexing and retrieving literature in the fields of education, science, social and human science, culture and communication and information (1995) 0.03

0.033184398 = product of:
  0.049776595 = sum of:
    0.015100324 = weight(_text_:in in 7325) [ClassicSimilarity], result of:
      0.015100324 = score(doc=7325,freq=4.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.21253976 = fieldWeight in 7325, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.078125 = fieldNorm(doc=7325)
    0.034676272 = product of:
      0.069352545 = sum of:
        0.069352545 = weight(_text_:science in 7325) [ClassicSimilarity], result of:
          0.069352545 = score(doc=7325,freq=6.0), product of:
            0.1375819 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.052230705 = queryNorm
            0.5040819 = fieldWeight in 7325, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.078125 = fieldNorm(doc=7325)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Footnote: Rez. in: Indexer 20(1996) no.2, S.108 (A. McCarthy); Journal of librarianship and information science 29(1997) no.3, S.165-166 (A. Gilchrist)

Wilson, T.D.: ¬The work of the British Classification Research Group (1972) 0.03

0.028096542 = product of:
  0.042144813 = sum of:
    0.01812039 = weight(_text_:in in 2766) [ClassicSimilarity], result of:
      0.01812039 = score(doc=2766,freq=4.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.25504774 = fieldWeight in 2766, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.09375 = fieldNorm(doc=2766)
    0.024024425 = product of:
      0.04804885 = sum of:
        0.04804885 = weight(_text_:science in 2766) [ClassicSimilarity], result of:
          0.04804885 = score(doc=2766,freq=2.0), product of:
            0.1375819 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.052230705 = queryNorm
            0.34923816 = fieldWeight in 2766, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.09375 = fieldNorm(doc=2766)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Series: Contributions in librarianship and information science; 3
Source: Subject retrieval in the seventies: new directions. Proc. of an int. symp. ... College Park, 14.-15.5.1971. Ed.: H.H. Wellisch u.a

Unesco thesaurus : a structured list of descriptors for indexing and retrieving literature in the fields of education, science, social science, culture and communication (1977) 0.02

0.020794988 = product of:
  0.03119248 = sum of:
    0.008542033 = weight(_text_:in in 6424) [ClassicSimilarity], result of:
      0.008542033 = score(doc=6424,freq=2.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.120230645 = fieldWeight in 6424, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0625 = fieldNorm(doc=6424)
    0.022650447 = product of:
      0.045300893 = sum of:
        0.045300893 = weight(_text_:science in 6424) [ClassicSimilarity], result of:
          0.045300893 = score(doc=6424,freq=4.0), product of:
            0.1375819 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.052230705 = queryNorm
            0.3292649 = fieldWeight in 6424, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0625 = fieldNorm(doc=6424)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Aitchison, J.: Bliss and the thesaurus : the bibliographic classification of H.E. Bliss as a source of thesaurus terms and structure (1986) 0.02

0.020465266 = product of:
  0.030697897 = sum of:
    0.010677542 = weight(_text_:in in 1570) [ClassicSimilarity], result of:
      0.010677542 = score(doc=1570,freq=2.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.15028831 = fieldWeight in 1570, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.078125 = fieldNorm(doc=1570)
    0.020020355 = product of:
      0.04004071 = sum of:
        0.04004071 = weight(_text_:science in 1570) [ClassicSimilarity], result of:
          0.04004071 = score(doc=1570,freq=2.0), product of:
            0.1375819 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.052230705 = queryNorm
            0.2910318 = fieldWeight in 1570, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.078125 = fieldNorm(doc=1570)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Footnote: Ähnlich auch in: Journal of documentation 42(1986) S.160-181
Source: Ranganathan's philosophy: assessment, impact and relevance. Proc. of the Int. Conf. organised by the Indian Library Association an co-sponsored by Sarada Ranganathan' Endowment for Library Science. Ed.: T.S. Rajagopalan

Hatapuc, A.: De la vocabular controlat la tezaur : schita de proiect pentru domeniul siintelor politice (2003) 0.02

0.020465266 = product of:
  0.030697897 = sum of:
    0.010677542 = weight(_text_:in in 2083) [ClassicSimilarity], result of:
      0.010677542 = score(doc=2083,freq=2.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.15028831 = fieldWeight in 2083, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.078125 = fieldNorm(doc=2083)
    0.020020355 = product of:
      0.04004071 = sum of:
        0.04004071 = weight(_text_:science in 2083) [ClassicSimilarity], result of:
          0.04004071 = score(doc=2083,freq=2.0), product of:
            0.1375819 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.052230705 = queryNorm
            0.2910318 = fieldWeight in 2083, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.078125 = fieldNorm(doc=2083)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Footnote: Übers. des Titels: On controlled vocabulaires: a project in the field of political science -

Tudhope, D.; Binding, C.; Blocks, D.; Cuncliffe, D.: Representation and retrieval in faceted systems (2003) 0.01
```
0.014632022 = product of:
  0.021948032 = sum of:
    0.011937855 = weight(_text_:in in 2703) [ClassicSimilarity], result of:
      0.011937855 = score(doc=2703,freq=10.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.16802745 = fieldWeight in 2703, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2703)
    0.010010177 = product of:
      0.020020355 = sum of:
        0.020020355 = weight(_text_:science in 2703) [ClassicSimilarity], result of:
          0.020020355 = score(doc=2703,freq=2.0), product of:
            0.1375819 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.052230705 = queryNorm
            0.1455159 = fieldWeight in 2703, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2703)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)
```
Abstract

This paper discusses two inter-related themes: the retrieval potential of faceted thesauri and XML representations of fundamental facets. Initial findings are discussed from the ongoing 'FACET' project, in collaboration with the National Museum of Science and Industry. The work discussed seeks to take advantage of the structure afforded by faceted systems for multi-term queries and flexible matching, focusing in this paper an the Art and Architecture Thesaurus. A multi-term matching function yields ranked results with partial matches via semantic term expansion, based an a measure of distance over the semantic index space formed by thesaurus relationships. Our intention is to drive the system from general representations and a common query structure and interface. To this end, we are developing an XML representation based an work by the Classification Research Group an fundamental facets or categories. The XML representation maps categories to particular thesauri and hierarchies. The system interface, which is configured by the mapping, incorporates a thesaurus browser with navigation history together with a term search facility and drag and drop query builder.

Series

Advances in knowledge organization; vol.8

Source

Challenges in knowledge representation and organization for the 21st century: Integration of knowledge across boundaries. Proceedings of the 7th ISKO International Conference Granada, Spain, July 10-13, 2002. Ed.: M. López-Huertas
Tudhope, D.; Binding, C.; Blocks, D.; Cunliffe, D.: Compound descriptors in context : a matching function for classifications and thesauri (2002) 0.01
```
0.014632022 = product of:
  0.021948032 = sum of:
    0.011937855 = weight(_text_:in in 3179) [ClassicSimilarity], result of:
      0.011937855 = score(doc=3179,freq=10.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.16802745 = fieldWeight in 3179, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3179)
    0.010010177 = product of:
      0.020020355 = sum of:
        0.020020355 = weight(_text_:science in 3179) [ClassicSimilarity], result of:
          0.020020355 = score(doc=3179,freq=2.0), product of:
            0.1375819 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.052230705 = queryNorm
            0.1455159 = fieldWeight in 3179, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3179)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)
```
Abstract

There are many advantages for Digital Libraries in indexing with classifications or thesauri, but some current disincentive in the lack of flexible retrieval tools that deal with compound descriptors. This paper discusses a matching function for compound descriptors, or multi-concept subject headings, that does not rely an exact matching but incorporates term expansion via thesaurus semantic relationships to produce ranked results that take account of missing and partially matching terms. The matching function is based an a measure of semantic closeness between terms, which has the potential to help with recall problems. The work reported is part of the ongoing FACET project in collaboration with the National Museum of Science and Industry and its collections database. The architecture of the prototype system and its Interface are outlined. The matching problem for compound descriptors is reviewed and the FACET implementation described. Results are discussed from scenarios using the faceted Getty Art and Architecture Thesaurus. We argue that automatic traversal of thesaurus relationships can augment the user's browsing possibilities. The techniques can be applied both to unstructured multi-concept subject headings and potentially to more syntactically structured strings. The notion of a focus term is used by the matching function to model AAT modified descriptors (noun phrases). The relevance of the approach to precoordinated indexing and matching faceted strings is discussed.

Theme

Semantisches Umfeld in Indexierung u. Retrieval

Root thesaurus. Pt.1.2 (1985) 0.01

0.014153078 = product of:
  0.042459235 = sum of:
    0.042459235 = product of:
      0.08491847 = sum of:
        0.08491847 = weight(_text_:22 in 467) [ClassicSimilarity], result of:
          0.08491847 = score(doc=467,freq=2.0), product of:
            0.18290302 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.052230705 = queryNorm
            0.46428138 = fieldWeight in 467, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=467)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Date: 18. 5.2007 14:22:43

Aitchison, J.: ¬A classification as a source for a thesaurus : the bibliographic classification of H.E. Bliss as a source of thesaurus terms and structure (1986) 0.01
```
0.014048271 = product of:
  0.021072406 = sum of:
    0.009060195 = weight(_text_:in in 1569) [ClassicSimilarity], result of:
      0.009060195 = score(doc=1569,freq=4.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.12752387 = fieldWeight in 1569, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.046875 = fieldNorm(doc=1569)
    0.012012213 = product of:
      0.024024425 = sum of:
        0.024024425 = weight(_text_:science in 1569) [ClassicSimilarity], result of:
          0.024024425 = score(doc=1569,freq=2.0), product of:
            0.1375819 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.052230705 = queryNorm
            0.17461908 = fieldWeight in 1569, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.046875 = fieldNorm(doc=1569)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)
```
Abstract

The second edition of the Bibliographic Classidication of H.E. Bliss (BC2), being prepared under the editorship of Jack Mills, Vanda Broughton and others, is a rich source of structure and terminology for thesauri covering different subject fields. The new edition employs facet analysis and is thesaurus-compatible. A number of facet-based thesauri have drawn upon Bliss for terms and relationships. In two of these thesauri the Bliss Classification was the source of both systematic and alphabetical displays. The DHSS-DATA thesaurus, published by the United Kingdom Department of Health and Social Security, provides controlled terms and Bliss class numbers for indexing and searching the DHSS-DATA database. The ECOT thesaurus (Educational courses and occupations thesaurus) prepared for the Department of Education and Science, uses the software sedigned for the British Standards Institution ROOT thesaurus to genearte an alphabetical display from the systematic display derived from the Bliss schedules. Problems, benefits, and future prospects of Bliss-based thesaurus construction are discussed

Footnote

Ähnlich auch in: Ranganathan's philosophy: assessment, impact and relevance
Tudhope, D.; Binding, C.; Blocks, D.; Cunliffe, D.: FACET: thesaurus retrieval with semantic term expansion (2002) 0.01
```
0.013392268 = product of:
  0.0200884 = sum of:
    0.01208026 = weight(_text_:in in 175) [ClassicSimilarity], result of:
      0.01208026 = score(doc=175,freq=16.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.17003182 = fieldWeight in 175, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.03125 = fieldNorm(doc=175)
    0.008008142 = product of:
      0.016016284 = sum of:
        0.016016284 = weight(_text_:science in 175) [ClassicSimilarity], result of:
          0.016016284 = score(doc=175,freq=2.0), product of:
            0.1375819 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.052230705 = queryNorm
            0.11641272 = fieldWeight in 175, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.03125 = fieldNorm(doc=175)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)
```
Abstract

There are many advantages for Digital Libraries in indexing with classifications or thesauri, but some current disincentive in the lack of flexible retrieval tools that deal with compound descriptors. This demonstration of a research prototype illustrates a matching function for compound descriptors, or multi-concept subject headings, that does not rely on exact matching but incorporates term expansion via thesaurus semantic relationships to produce ranked results that take account of missing and partially matching terms. The matching function is based on a measure of semantic closeness between terms.The work is part of the EPSRC funded FACET project in collaboration with the UK National Museum of Science and Industry (NMSI) which includes the National Railway Museum. An export of NMSI's Collections Database is used as the dataset for the research. The J. Paul Getty Trust's Art and Architecture Thesaurus (AAT) is the main thesaurus in the project. The AAT is a widely used thesaurus (over 120,000 terms). Descriptors are organised in 7 facets representing separate conceptual classes of terms.The FACET application is a multi tiered architecture accessing a SQL Server database, with an OLE DB connection. The thesauri are stored as relational tables in the Server's database. However, a key component of the system is a parallel representation of the underlying semantic network as an in-memory structure of thesaurus concepts (corresponding to preferred terms). The structure models the hierarchical and associative interrelationships of thesaurus concepts via weighted poly-hierarchical links. Its primary purpose is real-time semantic expansion of query terms, achieved by a spreading activation semantic closeness algorithm. Queries with associated results are stored persistently using XML format data. A Visual Basic interface combines a thesaurus browser and an initial term search facility that takes into account equivalence relationships. Terms are dragged to a direct manipulation Query Builder which maintains the facet structure.

Theme

Semantisches Umfeld in Indexierung u. Retrieval

Vickery, B.C.: Classificatory principles in natural language indexing systems (1976) 0.01

0.0070468183 = product of:
  0.021140454 = sum of:
    0.021140454 = weight(_text_:in in 1276) [ClassicSimilarity], result of:
      0.021140454 = score(doc=1276,freq=4.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.29755569 = fieldWeight in 1276, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.109375 = fieldNorm(doc=1276)
  0.33333334 = coord(1/3)

Source: Classification in the 1970s: a second look. Rev. ed. Ed.: A. Maltby

Green, R.: Making visible hidden relationships in the Dewey Decimal Classification : how relative index terms relate to DDC classes (2008) 0.01
```
0.0065916954 = product of:
  0.019775085 = sum of:
    0.019775085 = weight(_text_:in in 2236) [ClassicSimilarity], result of:
      0.019775085 = score(doc=2236,freq=14.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.27833787 = fieldWeight in 2236, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0546875 = fieldNorm(doc=2236)
  0.33333334 = coord(1/3)
```
Content

Relative Index (RI) terms in the Dewey Decimal Classification (DDC) system correspond to concepts that either approximate the whole of the class they index or that are in standing room there. DDC conventions and shallow natural language processing are used to determine automatically whether specific RI terms approximate the whole of or are in standing room in the classes they index. Approximately three-quarters of all RI terms are processed by the techniques described.

Series

Advances in knowledge organization; vol.11

Source

Culture and identity in knowledge organization: Proceedings of the Tenth International ISKO Conference 5-8 August 2008, Montreal, Canada. Ed. by Clément Arsenault and Joseph T. Tennis
Frâncu, V.: Harmonizing a universal classification system with an interdisciplinary multilingual thesaurus : advantages and limitations (2000) 0.01
```
0.006164682 = product of:
  0.018494045 = sum of:
    0.018494045 = weight(_text_:in in 108) [ClassicSimilarity], result of:
      0.018494045 = score(doc=108,freq=24.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.260307 = fieldWeight in 108, product of:
          4.8989797 = tf(freq=24.0), with freq of:
            24.0 = termFreq=24.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0390625 = fieldNorm(doc=108)
  0.33333334 = coord(1/3)
```
Abstract

The case under consideration is a project of building an interdisciplinary multilingual thesaurus (Romanian-English-French) starting from a list of indexing terms based on an abridged version of the Universal Decimal Classification (UDC). The resulting thesaurus is intended for public libraries for both indexing and searching purposes in bibliographic databases covering a wide range of topics but with a fairly low level of specificity. The problems encountered in such an approach fall into two groups: 1) concordance or compatibility problems in terms of the indexing languages considered (between a classification system and a thesaurus); 2) equivalence and, hence, translatability problems in terms of the natural languages involved. Additionally, the question of ambiguity given the co-occurrence of terms in more than one class, will be discussed with reference to homographs and polysemantic words. In a thesaurus with such a wide coverage yet with a low specificity level, the method adopted in the thesaurus construction was to provide as many lead-in terms as possible and post them up to the closest in meaning broader term in order to improve the recall ratio

Series

Advances in knowledge organization; vol.7

Source

Dynamism and stability in knowledge organization: Proceedings of the 6th International ISKO-Conference, 10-13 July 2000, Toronto, Canada. Ed.: C. Beghtol et al

Panzer, M.: DDC in Germany : Recent Developments and Current Activities (2007) 0.01

0.00604013 = product of:
  0.01812039 = sum of:
    0.01812039 = weight(_text_:in in 98) [ClassicSimilarity], result of:
      0.01812039 = score(doc=98,freq=4.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.25504774 = fieldWeight in 98, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.09375 = fieldNorm(doc=98)
  0.33333334 = coord(1/3)

Content: Vortrag auf dem ALA Midwinter Meeting im Januar 2007 in Seattle.

Gödert, W.: Komplementarität bei der Klassenbildung in der klassifikatorischen und verbalen Sacherschließung (1985) 0.01

0.00604013 = product of:
  0.01812039 = sum of:
    0.01812039 = weight(_text_:in in 3304) [ClassicSimilarity], result of:
      0.01812039 = score(doc=3304,freq=4.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.25504774 = fieldWeight in 3304, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.09375 = fieldNorm(doc=3304)
  0.33333334 = coord(1/3)

Source: Anwendungen in der Klassifikation. I. Proc. 8. Jahrestagung der Gesellschaft für Klassifikation, Hofgeismar, 10.4.-13.4.1984

Austin, D.: ¬The CRG research into a freely faceted scheme (1976) 0.01

0.0056946888 = product of:
  0.017084066 = sum of:
    0.017084066 = weight(_text_:in in 116) [ClassicSimilarity], result of:
      0.017084066 = score(doc=116,freq=2.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.24046129 = fieldWeight in 116, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.125 = fieldNorm(doc=116)
  0.33333334 = coord(1/3)

Source: Classification in the 1970s. Rev. ed

Negrini, G.: CLASTHES: a thesaurofacet generator (1994) 0.01

0.0053387615 = product of:
  0.016016284 = sum of:
    0.016016284 = product of:
      0.032032568 = sum of:
        0.032032568 = weight(_text_:science in 8347) [ClassicSimilarity], result of:
          0.032032568 = score(doc=8347,freq=2.0), product of:
            0.1375819 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.052230705 = queryNorm
            0.23282544 = fieldWeight in 8347, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0625 = fieldNorm(doc=8347)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Source: Library science with a slant to documentation. 31(1994) no.1, S.1-12

Frâncu, V.: ¬A universal classification system going through changes (2001) 0.01
```
0.0050334414 = product of:
  0.015100324 = sum of:
    0.015100324 = weight(_text_:in in 1593) [ClassicSimilarity], result of:
      0.015100324 = score(doc=1593,freq=16.0), product of:
        0.07104705 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.052230705 = queryNorm
        0.21253976 = fieldWeight in 1593, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1593)
  0.33333334 = coord(1/3)
```
Abstract

In the early 1990s, indexing with classification codes from the Universal Decimal Classification (UDC) in an academic library, going from traditional to automated routines in any and all its activities, suddenly proved insufficient. Under the circumstances of online searching, the possibilities offered by the new OPAC looked much more attractive to indexers and searchers alike. Therefore, a quick shift to indexing with UDC and keywords instead of UDC numbers alone was made. Currency, precision and, more importantly, user-friendliness were strong advantages offered by keyword indexing and searching. But the larger the dictionary of keywords, the more problematic the consequences an information scattering, given the lack of control an terms. The present paper describes the advantages of the UDC in indexing by presenting some of the devices it is provided with: subdivision by analogy, common and special auxiliaries, use of synthesis, and use of connecting symbols. The solution of indexing with both UDC notation and words from a thesaurus based an UDC was prompted by some other characteristics of the schedules: a semi-faceted classification system, hierarchical organisation, richness in terminology and consistency and control of notation. The methodology used in building the thesaurus is conceived according to the international standards (ISO 2788 and 5964) to which some principles have been added, giving the specific approach of harmonising a classification structure with that of a thesaurus. Compatibility and translatability issues are also considered and some problems arising from them are treated in detail. Most of the problems discussed are illustrated with examples.

Source

Advances in classification research, vol.10: proceedings of the 10th ASIS SIG/CR Classification Research Workshop. Ed.: Albrechtsen, H. u. J.E. Mai

Search (39 results, page 1 of 2)

Authors

Years

Languages

Types

Themes

Subjects

Classifications