Search (167 results, page 1 of 9)

Plaunt, C.; Norgard, B.A.: ¬An association-based method for automatic indexing with a controlled vocabulary (1998) 0.06

0.06499904 = sum of:
  0.01585282 = product of:
    0.06341128 = sum of:
      0.06341128 = weight(_text_:authors in 1794) [ClassicSimilarity], result of:
        0.06341128 = score(doc=1794,freq=2.0), product of:
          0.25179064 = queryWeight, product of:
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.05523161 = queryNorm
          0.25184128 = fieldWeight in 1794, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1794)
    0.25 = coord(1/4)
  0.049146216 = product of:
    0.07371932 = sum of:
      0.03630372 = weight(_text_:c in 1794) [ClassicSimilarity], result of:
        0.03630372 = score(doc=1794,freq=2.0), product of:
          0.1905162 = queryWeight, product of:
            3.4494052 = idf(docFreq=3817, maxDocs=44218)
            0.05523161 = queryNorm
          0.1905545 = fieldWeight in 1794, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.4494052 = idf(docFreq=3817, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1794)
      0.0374156 = weight(_text_:22 in 1794) [ClassicSimilarity], result of:
        0.0374156 = score(doc=1794,freq=2.0), product of:
          0.19341168 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.05523161 = queryNorm
          0.19345059 = fieldWeight in 1794, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1794)
    0.6666667 = coord(2/3)

Abstract: In this article, we describe and test a two-stage algorithm based on a lexical collocation technique which maps from the lexical clues contained in a document representation into a controlled vocabulary list of subject headings. Using a collection of 4.626 INSPEC documents, we create a 'dictionary' of associations between the lexical items contained in the titles, authors, and abstracts, and controlled vocabulary subject headings assigned to those records by human indexers using a likelihood ratio statistic as the measure of association. In the deployment stage, we use the dictiony to predict which of the controlled vocabulary subject headings best describe new documents when they are presented to the system. Our evaluation of this algorithm, in which we compare the automatically assigned subject headings to the subject headings assigned to the test documents by human catalogers, shows that we can obtain results comparable to, and consistent with, human cataloging. In effect we have cast this as a classic partial match information retrieval problem. We consider the problem to be one of 'retrieving' (or assigning) the most probably 'relevant' (or correct) controlled vocabulary subject headings to a document based on the clues contained in that document
Date: 11. 9.2000 19:53:22

Schneider, C.; Womser-Hacker, C.: Inhaltserschließungssysteme für Patenttexte : Test und Systemvergleich im Projekt PADOK (1986) 0.06

0.05613949 = product of:
  0.11227898 = sum of:
    0.11227898 = product of:
      0.16841847 = sum of:
        0.12321892 = weight(_text_:c in 2648) [ClassicSimilarity], result of:
          0.12321892 = score(doc=2648,freq=4.0), product of:
            0.1905162 = queryWeight, product of:
              3.4494052 = idf(docFreq=3817, maxDocs=44218)
              0.05523161 = queryNorm
            0.64676344 = fieldWeight in 2648, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.4494052 = idf(docFreq=3817, maxDocs=44218)
              0.09375 = fieldNorm(doc=2648)
        0.04519956 = weight(_text_:h in 2648) [ClassicSimilarity], result of:
          0.04519956 = score(doc=2648,freq=2.0), product of:
            0.13722013 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.05523161 = queryNorm
            0.32939452 = fieldWeight in 2648, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.09375 = fieldNorm(doc=2648)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Source: Deutscher Dokumentartag 1986, Freiburg, 8.-10.10.1986: Bedarfsorientierte Fachinformation: Methoden und Techniken am Arbeitsplatz. Bearb.: H. Strohl-Goebel

Fuhr, N.; Niewelt, B.: ¬Ein Retrievaltest mit automatisch indexierten Dokumenten (1984) 0.05

0.052498832 = product of:
  0.104997665 = sum of:
    0.104997665 = product of:
      0.1574965 = sum of:
        0.052732818 = weight(_text_:h in 262) [ClassicSimilarity], result of:
          0.052732818 = score(doc=262,freq=2.0), product of:
            0.13722013 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.05523161 = queryNorm
            0.38429362 = fieldWeight in 262, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.109375 = fieldNorm(doc=262)
        0.10476368 = weight(_text_:22 in 262) [ClassicSimilarity], result of:
          0.10476368 = score(doc=262,freq=2.0), product of:
            0.19341168 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.05523161 = queryNorm
            0.5416616 = fieldWeight in 262, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=262)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Date: 20.10.2000 12:22:23
Source: Deutscher Dokumentartag 1983, Göttingen, 3.-7.10.1983: Fachinformation und Bildschirmtext. Bearb.: H. Strohl-Goebel

Schwarz, C.: Komplexe Nominalgruppen als Indexierungseinheiten am Beispiel des Projekte CONDOR (1982) 0.05

0.051461082 = product of:
  0.102922164 = sum of:
    0.102922164 = product of:
      0.15438324 = sum of:
        0.10165042 = weight(_text_:c in 435) [ClassicSimilarity], result of:
          0.10165042 = score(doc=435,freq=2.0), product of:
            0.1905162 = queryWeight, product of:
              3.4494052 = idf(docFreq=3817, maxDocs=44218)
              0.05523161 = queryNorm
            0.5335526 = fieldWeight in 435, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.4494052 = idf(docFreq=3817, maxDocs=44218)
              0.109375 = fieldNorm(doc=435)
        0.052732818 = weight(_text_:h in 435) [ClassicSimilarity], result of:
          0.052732818 = score(doc=435,freq=2.0), product of:
            0.13722013 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.05523161 = queryNorm
            0.38429362 = fieldWeight in 435, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.109375 = fieldNorm(doc=435)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Source: Deutscher Dokumentartag 1981, Mainz, 5.-8.10.1981: Kleincomputer in Information und Dokumentation. Bearb.: H. Strohl-Goebel

Fuhr, N.: Ranking-Experimente mit gewichteter Indexierung (1986) 0.04

0.044999 = product of:
  0.089998 = sum of:
    0.089998 = product of:
      0.134997 = sum of:
        0.04519956 = weight(_text_:h in 58) [ClassicSimilarity], result of:
          0.04519956 = score(doc=58,freq=2.0), product of:
            0.13722013 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.05523161 = queryNorm
            0.32939452 = fieldWeight in 58, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.09375 = fieldNorm(doc=58)
        0.08979744 = weight(_text_:22 in 58) [ClassicSimilarity], result of:
          0.08979744 = score(doc=58,freq=2.0), product of:
            0.19341168 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.05523161 = queryNorm
            0.46428138 = fieldWeight in 58, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=58)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Date: 14. 6.2015 22:12:44
Source: Deutscher Dokumentartag 1985, Nürnberg, 1.-4.10.1985: Fachinformation: Methodik - Management - Markt; neue Entwicklungen, Berufe, Produkte. Bearb.: H. Strohl-Goebel

Hauer, M.: Tiefenindexierung im Bibliothekskatalog : 17 Jahre intelligentCAPTURE (2019) 0.04

0.044999 = product of:
  0.089998 = sum of:
    0.089998 = product of:
      0.134997 = sum of:
        0.04519956 = weight(_text_:h in 5629) [ClassicSimilarity], result of:
          0.04519956 = score(doc=5629,freq=2.0), product of:
            0.13722013 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.05523161 = queryNorm
            0.32939452 = fieldWeight in 5629, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.09375 = fieldNorm(doc=5629)
        0.08979744 = weight(_text_:22 in 5629) [ClassicSimilarity], result of:
          0.08979744 = score(doc=5629,freq=2.0), product of:
            0.19341168 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.05523161 = queryNorm
            0.46428138 = fieldWeight in 5629, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=5629)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Source: B.I.T.online. 22(2019) H.2, S.163-166

Oliver, C.: Leveraging KOS to extend our reach with automated processes (2021) 0.04

0.04472649 = sum of:
  0.025364509 = product of:
    0.101458035 = sum of:
      0.101458035 = weight(_text_:authors in 722) [ClassicSimilarity], result of:
        0.101458035 = score(doc=722,freq=2.0), product of:
          0.25179064 = queryWeight, product of:
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.05523161 = queryNorm
          0.40294603 = fieldWeight in 722, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.0625 = fieldNorm(doc=722)
    0.25 = coord(1/4)
  0.019361984 = product of:
    0.058085952 = sum of:
      0.058085952 = weight(_text_:c in 722) [ClassicSimilarity], result of:
        0.058085952 = score(doc=722,freq=2.0), product of:
          0.1905162 = queryWeight, product of:
            3.4494052 = idf(docFreq=3817, maxDocs=44218)
            0.05523161 = queryNorm
          0.3048872 = fieldWeight in 722, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.4494052 = idf(docFreq=3817, maxDocs=44218)
            0.0625 = fieldNorm(doc=722)
    0.33333334 = coord(1/3)

Abstract: This article provides a conclusion to the special issue on Artificial Intelligence (AI) and Automated Processes for Subject Access. The authors who contributed to this special issue have provoked interesting questions as well as bringing attention to important issues. This concluding article looks at common themes and highlights some of the questions raised.

Martins, A.L.; Souza, R.R.; Ribeiro de Mello, H.: ¬The use of noun phrases in information retrieval : proposing a mechanism for automatic classification (2014) 0.04
```
0.043035988 = product of:
  0.086071976 = sum of:
    0.086071976 = sum of:
      0.041072972 = weight(_text_:c in 1441) [ClassicSimilarity], result of:
        0.041072972 = score(doc=1441,freq=4.0), product of:
          0.1905162 = queryWeight, product of:
            3.4494052 = idf(docFreq=3817, maxDocs=44218)
            0.05523161 = queryNorm
          0.21558782 = fieldWeight in 1441, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            3.4494052 = idf(docFreq=3817, maxDocs=44218)
            0.03125 = fieldNorm(doc=1441)
      0.01506652 = weight(_text_:h in 1441) [ClassicSimilarity], result of:
        0.01506652 = score(doc=1441,freq=2.0), product of:
          0.13722013 = queryWeight, product of:
            2.4844491 = idf(docFreq=10020, maxDocs=44218)
            0.05523161 = queryNorm
          0.10979818 = fieldWeight in 1441, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            2.4844491 = idf(docFreq=10020, maxDocs=44218)
            0.03125 = fieldNorm(doc=1441)
      0.029932482 = weight(_text_:22 in 1441) [ClassicSimilarity], result of:
        0.029932482 = score(doc=1441,freq=2.0), product of:
          0.19341168 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.05523161 = queryNorm
          0.15476047 = fieldWeight in 1441, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.03125 = fieldNorm(doc=1441)
  0.5 = coord(1/2)
```
Abstract

This paper presents a research on syntactic structures known as noun phrases (NP) being applied to increase the effectiveness and efficiency of the mechanisms for the document's classification. Our hypothesis is the fact that the NP can be used instead of single words as a semantic aggregator to reduce the number of words that will be used for the classification system without losing its semantic coverage, increasing its efficiency. The experiment divided the documents classification process in three phases: a) NP preprocessing b) system training; and c) classification experiments. In the first step, a corpus of digitalized texts was submitted to a natural language processing platform1 in which the part-of-speech tagging was done, and them PERL scripts pertaining to the PALAVRAS package were used to extract the Noun Phrases. The preprocessing also involved the tasks of a) removing NP low meaning pre-modifiers, as quantifiers; b) identification of synonyms and corresponding substitution for common hyperonyms; and c) stemming of the relevant words contained in the NP, for similitude checking with other NPs. The first tests with the resulting documents have demonstrated its effectiveness. We have compared the structural similarity of the documents before and after the whole pre-processing steps of phase one. The texts maintained the consistency with the original and have kept the readability. The second phase involves submitting the modified documents to a SVM algorithm to identify clusters and classify the documents. The classification rules are to be established using a machine learning approach. Finally, tests will be conducted to check the effectiveness of the whole process.

Source

Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik

Chou, C.; Chu, T.: ¬An analysis of BERT (NLP) for assisted subject indexing for Project Gutenberg (2022) 0.04

0.039135683 = sum of:
  0.022193946 = product of:
    0.088775784 = sum of:
      0.088775784 = weight(_text_:authors in 1139) [ClassicSimilarity], result of:
        0.088775784 = score(doc=1139,freq=2.0), product of:
          0.25179064 = queryWeight, product of:
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.05523161 = queryNorm
          0.35257778 = fieldWeight in 1139, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.0546875 = fieldNorm(doc=1139)
    0.25 = coord(1/4)
  0.016941737 = product of:
    0.05082521 = sum of:
      0.05082521 = weight(_text_:c in 1139) [ClassicSimilarity], result of:
        0.05082521 = score(doc=1139,freq=2.0), product of:
          0.1905162 = queryWeight, product of:
            3.4494052 = idf(docFreq=3817, maxDocs=44218)
            0.05523161 = queryNorm
          0.2667763 = fieldWeight in 1139, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.4494052 = idf(docFreq=3817, maxDocs=44218)
            0.0546875 = fieldNorm(doc=1139)
    0.33333334 = coord(1/3)

Abstract: In light of AI (Artificial Intelligence) and NLP (Natural language processing) technologies, this article examines the feasibility of using AI/NLP models to enhance the subject indexing of digital resources. While BERT (Bidirectional Encoder Representations from Transformers) models are widely used in scholarly communities, the authors assess whether BERT models can be used in machine-assisted indexing in the Project Gutenberg collection, through suggesting Library of Congress subject headings filtered by certain Library of Congress Classification subclass labels. The findings of this study are informative for further research on BERT models to assist with automatic subject indexing for digital library collections.

Greiner-Petter, A.; Schubotz, M.; Cohl, H.S.; Gipp, B.: Semantic preserving bijective mappings for expressions involving special functions between computer algebra systems and document preparation systems (2019) 0.03
```
0.031943806 = sum of:
  0.021966312 = product of:
    0.08786525 = sum of:
      0.08786525 = weight(_text_:authors in 5499) [ClassicSimilarity], result of:
        0.08786525 = score(doc=5499,freq=6.0), product of:
          0.25179064 = queryWeight, product of:
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.05523161 = queryNorm
          0.34896153 = fieldWeight in 5499, product of:
            2.4494898 = tf(freq=6.0), with freq of:
              6.0 = termFreq=6.0
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.03125 = fieldNorm(doc=5499)
    0.25 = coord(1/4)
  0.009977494 = product of:
    0.029932482 = sum of:
      0.029932482 = weight(_text_:22 in 5499) [ClassicSimilarity], result of:
        0.029932482 = score(doc=5499,freq=2.0), product of:
          0.19341168 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.05523161 = queryNorm
          0.15476047 = fieldWeight in 5499, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.03125 = fieldNorm(doc=5499)
    0.33333334 = coord(1/3)
```
Abstract

Purpose Modern mathematicians and scientists of math-related disciplines often use Document Preparation Systems (DPS) to write and Computer Algebra Systems (CAS) to calculate mathematical expressions. Usually, they translate the expressions manually between DPS and CAS. This process is time-consuming and error-prone. The purpose of this paper is to automate this translation. This paper uses Maple and Mathematica as the CAS, and LaTeX as the DPS. Design/methodology/approach Bruce Miller at the National Institute of Standards and Technology (NIST) developed a collection of special LaTeX macros that create links from mathematical symbols to their definitions in the NIST Digital Library of Mathematical Functions (DLMF). The authors are using these macros to perform rule-based translations between the formulae in the DLMF and CAS. Moreover, the authors develop software to ease the creation of new rules and to discover inconsistencies. Findings The authors created 396 mappings and translated 58.8 percent of DLMF formulae (2,405 expressions) successfully between Maple and DLMF. For a significant percentage, the special function definitions in Maple and the DLMF were different. An atomic symbol in one system maps to a composite expression in the other system. The translator was also successfully used for automatic verification of mathematical online compendia and CAS. The evaluation techniques discovered two errors in the DLMF and one defect in Maple. Originality/value This paper introduces the first translation tool for special functions between LaTeX and CAS. The approach improves error-prone manual translations and can be used to verify mathematical online compendia and CAS.

Date

20. 1.2015 18:30:22

Lepsky, K.; Vorhauer, J.: Lingo - ein open source System für die Automatische Indexierung deutschsprachiger Dokumente (2006) 0.03

0.029999336 = product of:
  0.059998672 = sum of:
    0.059998672 = product of:
      0.08999801 = sum of:
        0.03013304 = weight(_text_:h in 3581) [ClassicSimilarity], result of:
          0.03013304 = score(doc=3581,freq=2.0), product of:
            0.13722013 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.05523161 = queryNorm
            0.21959636 = fieldWeight in 3581, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.0625 = fieldNorm(doc=3581)
        0.059864964 = weight(_text_:22 in 3581) [ClassicSimilarity], result of:
          0.059864964 = score(doc=3581,freq=2.0), product of:
            0.19341168 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.05523161 = queryNorm
            0.30952093 = fieldWeight in 3581, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=3581)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Date: 24. 3.2006 12:22:02
Source: ABI-Technik. 26(2006) H.1, S.18-28

Probst, M.; Mittelbach, J.: Maschinelle Indexierung in der Sacherschließung wissenschaftlicher Bibliotheken (2006) 0.03

0.029999336 = product of:
  0.059998672 = sum of:
    0.059998672 = product of:
      0.08999801 = sum of:
        0.03013304 = weight(_text_:h in 1755) [ClassicSimilarity], result of:
          0.03013304 = score(doc=1755,freq=2.0), product of:
            0.13722013 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.05523161 = queryNorm
            0.21959636 = fieldWeight in 1755, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.0625 = fieldNorm(doc=1755)
        0.059864964 = weight(_text_:22 in 1755) [ClassicSimilarity], result of:
          0.059864964 = score(doc=1755,freq=2.0), product of:
            0.19341168 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.05523161 = queryNorm
            0.30952093 = fieldWeight in 1755, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=1755)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Date: 22. 3.2008 12:35:19
Source: Bibliothek: Forschung und Praxis. 30(2006) H.2, S.168-176

Abdul, H.; Khoo, C.: Automatic indexing of medical literature using phrase matching : an exploratory study 0.03

0.029406331 = product of:
  0.058812663 = sum of:
    0.058812663 = product of:
      0.088218994 = sum of:
        0.058085952 = weight(_text_:c in 3601) [ClassicSimilarity], result of:
          0.058085952 = score(doc=3601,freq=2.0), product of:
            0.1905162 = queryWeight, product of:
              3.4494052 = idf(docFreq=3817, maxDocs=44218)
              0.05523161 = queryNorm
            0.3048872 = fieldWeight in 3601, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.4494052 = idf(docFreq=3817, maxDocs=44218)
              0.0625 = fieldNorm(doc=3601)
        0.03013304 = weight(_text_:h in 3601) [ClassicSimilarity], result of:
          0.03013304 = score(doc=3601,freq=2.0), product of:
            0.13722013 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.05523161 = queryNorm
            0.21959636 = fieldWeight in 3601, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.0625 = fieldNorm(doc=3601)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Krause, J.; Womser-Hacker, C.: PADOK-II : Retrievaltests zur Bewertung von Volltextindexierungsvarianten für das deutsche Patentinformationssystem (1990) 0.03

0.029406331 = product of:
  0.058812663 = sum of:
    0.058812663 = product of:
      0.088218994 = sum of:
        0.058085952 = weight(_text_:c in 2653) [ClassicSimilarity], result of:
          0.058085952 = score(doc=2653,freq=2.0), product of:
            0.1905162 = queryWeight, product of:
              3.4494052 = idf(docFreq=3817, maxDocs=44218)
              0.05523161 = queryNorm
            0.3048872 = fieldWeight in 2653, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.4494052 = idf(docFreq=3817, maxDocs=44218)
              0.0625 = fieldNorm(doc=2653)
        0.03013304 = weight(_text_:h in 2653) [ClassicSimilarity], result of:
          0.03013304 = score(doc=2653,freq=2.0), product of:
            0.13722013 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.05523161 = queryNorm
            0.21959636 = fieldWeight in 2653, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.0625 = fieldNorm(doc=2653)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Source: Nachrichten für Dokumentation. 41(1990) H.1, S.13-19

Schöning-Walter, C.: Automatische Erschließungsverfahren für Netzpublikationen : zum Stand der Arbeiten im Projekt PETRUS (2011) 0.03

0.029406331 = product of:
  0.058812663 = sum of:
    0.058812663 = product of:
      0.088218994 = sum of:
        0.058085952 = weight(_text_:c in 1714) [ClassicSimilarity], result of:
          0.058085952 = score(doc=1714,freq=2.0), product of:
            0.1905162 = queryWeight, product of:
              3.4494052 = idf(docFreq=3817, maxDocs=44218)
              0.05523161 = queryNorm
            0.3048872 = fieldWeight in 1714, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.4494052 = idf(docFreq=3817, maxDocs=44218)
              0.0625 = fieldNorm(doc=1714)
        0.03013304 = weight(_text_:h in 1714) [ClassicSimilarity], result of:
          0.03013304 = score(doc=1714,freq=2.0), product of:
            0.13722013 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.05523161 = queryNorm
            0.21959636 = fieldWeight in 1714, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.0625 = fieldNorm(doc=1714)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Source: Dialog mit Bibliotheken. 23(2011) H.1, S.31-36

Toepfer, M.; Seifert, C.: Content-based quality estimation for automatic subject indexing of short texts under precision and recall constraints 0.03

0.02795406 = sum of:
  0.01585282 = product of:
    0.06341128 = sum of:
      0.06341128 = weight(_text_:authors in 4309) [ClassicSimilarity], result of:
        0.06341128 = score(doc=4309,freq=2.0), product of:
          0.25179064 = queryWeight, product of:
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.05523161 = queryNorm
          0.25184128 = fieldWeight in 4309, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.0390625 = fieldNorm(doc=4309)
    0.25 = coord(1/4)
  0.01210124 = product of:
    0.03630372 = sum of:
      0.03630372 = weight(_text_:c in 4309) [ClassicSimilarity], result of:
        0.03630372 = score(doc=4309,freq=2.0), product of:
          0.1905162 = queryWeight, product of:
            3.4494052 = idf(docFreq=3817, maxDocs=44218)
            0.05523161 = queryNorm
          0.1905545 = fieldWeight in 4309, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.4494052 = idf(docFreq=3817, maxDocs=44218)
            0.0390625 = fieldNorm(doc=4309)
    0.33333334 = coord(1/3)

Content: This is an authors' manuscript version of a paper accepted for proceedings of TPDL-2018, Porto, Portugal, Sept 10-13. The nal authenticated publication is available online at https://doi.org/will be added as soon as available.

Renz, M.: Automatische Inhaltserschließung im Zeichen von Wissensmanagement (2001) 0.03

0.026249416 = product of:
  0.052498832 = sum of:
    0.052498832 = product of:
      0.07874825 = sum of:
        0.026366409 = weight(_text_:h in 5671) [ClassicSimilarity], result of:
          0.026366409 = score(doc=5671,freq=2.0), product of:
            0.13722013 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.05523161 = queryNorm
            0.19214681 = fieldWeight in 5671, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5671)
        0.05238184 = weight(_text_:22 in 5671) [ClassicSimilarity], result of:
          0.05238184 = score(doc=5671,freq=2.0), product of:
            0.19341168 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.05523161 = queryNorm
            0.2708308 = fieldWeight in 5671, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5671)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Date: 22. 3.2001 13:14:48
Source: nfd Information - Wissenschaft und Praxis. 52(2001) H.2, S.69-78

Kasprzik, A.: Voraussetzungen und Anwendungspotentiale einer präzisen Sacherschließung aus Sicht der Wissenschaft (2018) 0.03

0.026249416 = product of:
  0.052498832 = sum of:
    0.052498832 = product of:
      0.07874825 = sum of:
        0.026366409 = weight(_text_:h in 5195) [ClassicSimilarity], result of:
          0.026366409 = score(doc=5195,freq=2.0), product of:
            0.13722013 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.05523161 = queryNorm
            0.19214681 = fieldWeight in 5195, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5195)
        0.05238184 = weight(_text_:22 in 5195) [ClassicSimilarity], result of:
          0.05238184 = score(doc=5195,freq=2.0), product of:
            0.19341168 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.05523161 = queryNorm
            0.2708308 = fieldWeight in 5195, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5195)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Abstract: Große Aufmerksamkeit richtet sich im Moment auf das Potential von automatisierten Methoden in der Sacherschließung und deren Interaktionsmöglichkeiten mit intellektuellen Methoden. In diesem Kontext befasst sich der vorliegende Beitrag mit den folgenden Fragen: Was sind die Anforderungen an bibliothekarische Metadaten aus Sicht der Wissenschaft? Was wird gebraucht, um den Informationsbedarf der Fachcommunities zu bedienen? Und was bedeutet das entsprechend für die Automatisierung der Metadatenerstellung und -pflege? Dieser Beitrag fasst die von der Autorin eingenommene Position in einem Impulsvortrag und der Podiumsdiskussion beim Workshop der FAG "Erschließung und Informationsvermittlung" des GBV zusammen. Der Workshop fand im Rahmen der 22. Verbundkonferenz des GBV statt.
Source: ABI-Technik. 38(2018) H.4, S.332-335

Franke-Maier, M.: Anforderungen an die Qualität der Inhaltserschließung im Spannungsfeld von intellektuell und automatisch erzeugten Metadaten (2018) 0.03

0.026249416 = product of:
  0.052498832 = sum of:
    0.052498832 = product of:
      0.07874825 = sum of:
        0.026366409 = weight(_text_:h in 5344) [ClassicSimilarity], result of:
          0.026366409 = score(doc=5344,freq=2.0), product of:
            0.13722013 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.05523161 = queryNorm
            0.19214681 = fieldWeight in 5344, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5344)
        0.05238184 = weight(_text_:22 in 5344) [ClassicSimilarity], result of:
          0.05238184 = score(doc=5344,freq=2.0), product of:
            0.19341168 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.05523161 = queryNorm
            0.2708308 = fieldWeight in 5344, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5344)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Abstract: Spätestens seit dem Deutschen Bibliothekartag 2018 hat sich die Diskussion zu den automatischen Verfahren der Inhaltserschließung der Deutschen Nationalbibliothek von einer politisch geführten Diskussion in eine Qualitätsdiskussion verwandelt. Der folgende Beitrag beschäftigt sich mit Fragen der Qualität von Inhaltserschließung in digitalen Zeiten, wo heterogene Erzeugnisse unterschiedlicher Verfahren aufeinandertreffen und versucht, wichtige Anforderungen an Qualität zu definieren. Dieser Tagungsbeitrag fasst die vom Autor als Impulse vorgetragenen Ideen beim Workshop der FAG "Erschließung und Informationsvermittlung" des GBV am 29. August 2018 in Kiel zusammen. Der Workshop fand im Rahmen der 22. Verbundkonferenz des GBV statt.
Source: ABI-Technik. 38(2018) H.4, S.327-331

Mesquita, L.A.P.; Souza, R.R.; Baracho Porto, R.M.A.: Noun phrases in automatic indexing: : a structural analysis of the distribution of relevant terms in doctoral theses (2014) 0.02
```
0.022659749 = sum of:
  0.012682254 = product of:
    0.050729018 = sum of:
      0.050729018 = weight(_text_:authors in 1442) [ClassicSimilarity], result of:
        0.050729018 = score(doc=1442,freq=2.0), product of:
          0.25179064 = queryWeight, product of:
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.05523161 = queryNorm
          0.20147301 = fieldWeight in 1442, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.03125 = fieldNorm(doc=1442)
    0.25 = coord(1/4)
  0.009977494 = product of:
    0.029932482 = sum of:
      0.029932482 = weight(_text_:22 in 1442) [ClassicSimilarity], result of:
        0.029932482 = score(doc=1442,freq=2.0), product of:
          0.19341168 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.05523161 = queryNorm
          0.15476047 = fieldWeight in 1442, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.03125 = fieldNorm(doc=1442)
    0.33333334 = coord(1/3)
```
Abstract

The main objective of this research was to analyze whether there was a characteristic distribution behavior of relevant terms over a scientific text that could contribute as a criterion for their process of automatic indexing. The terms considered in this study were only full noun phrases contained in the texts themselves. The texts were considered a total of 98 doctoral theses of the eight areas of knowledge in a same university. Initially, 20 full noun phrases were automatically extracted from each text as candidates to be the most relevant terms, and each author of each text assigned a relevance value 0-6 (not relevant and highly relevant, respectively) for each of the 20 noun phrases sent. Only, 22.1 % of noun phrases were considered not relevant. A relevance values of the terms assigned by the authors were associated with their positions in the text. Each full noun phrases found in the text was considered as a valid linear position. The results that were obtained showed values resulting from this distribution by considering two types of position: linear, with values consolidated into ten equal consecutive parts; and structural, considering parts of the text (such as introduction, development and conclusion). As a result of considerable importance, all areas of knowledge related to the Natural Sciences showed a characteristic behavior in the distribution of relevant terms, as well as all areas of knowledge related to Social Sciences showed the same characteristic behavior of distribution, but distinct from the Natural Sciences. The difference of the distribution behavior between the Natural and Social Sciences can be clearly visualized through graphs. All behaviors, including the general behavior of all areas of knowledge together, were characterized in polynomial equations and can be applied in future as criteria for automatic indexing. Until the present date this work has become inedited of for two reasons: to present a method for characterizing the distribution of relevant terms in a scientific text, and also, through this method, pointing out a quantitative trait difference between the Natural and Social Sciences.

Source

Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik

Search (167 results, page 1 of 9)

Authors

Years

Languages

Types

Themes

Subjects

Classifications