-
Schneider, C.; Womser-Hacker, C.: Inhaltserschließungssysteme für Patenttexte : Test und Systemvergleich im Projekt PADOK (1986)
0.05
0.04600405 = product of:
0.0920081 = sum of:
0.0920081 = product of:
0.13801214 = sum of:
0.100972936 = weight(_text_:c in 2648) [ClassicSimilarity], result of:
0.100972936 = score(doc=2648,freq=4.0), product of:
0.15612034 = queryWeight, product of:
3.4494052 = idf(docFreq=3817, maxDocs=44218)
0.045260075 = queryNorm
0.64676344 = fieldWeight in 2648, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.4494052 = idf(docFreq=3817, maxDocs=44218)
0.09375 = fieldNorm(doc=2648)
0.037039213 = weight(_text_:h in 2648) [ClassicSimilarity], result of:
0.037039213 = score(doc=2648,freq=2.0), product of:
0.11244635 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045260075 = queryNorm
0.32939452 = fieldWeight in 2648, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.09375 = fieldNorm(doc=2648)
0.6666667 = coord(2/3)
0.5 = coord(1/2)
- Source
- Deutscher Dokumentartag 1986, Freiburg, 8.-10.10.1986: Bedarfsorientierte Fachinformation: Methoden und Techniken am Arbeitsplatz. Bearb.: H. Strohl-Goebel
-
Fuhr, N.; Niewelt, B.: ¬Ein Retrievaltest mit automatisch indexierten Dokumenten (1984)
0.04
0.043020677 = product of:
0.08604135 = sum of:
0.08604135 = product of:
0.12906203 = sum of:
0.043212414 = weight(_text_:h in 262) [ClassicSimilarity], result of:
0.043212414 = score(doc=262,freq=2.0), product of:
0.11244635 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045260075 = queryNorm
0.38429362 = fieldWeight in 262, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.109375 = fieldNorm(doc=262)
0.08584961 = weight(_text_:22 in 262) [ClassicSimilarity], result of:
0.08584961 = score(doc=262,freq=2.0), product of:
0.15849307 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.045260075 = queryNorm
0.5416616 = fieldWeight in 262, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.109375 = fieldNorm(doc=262)
0.6666667 = coord(2/3)
0.5 = coord(1/2)
- Date
- 20.10.2000 12:22:23
- Source
- Deutscher Dokumentartag 1983, Göttingen, 3.-7.10.1983: Fachinformation und Bildschirmtext. Bearb.: H. Strohl-Goebel
-
Schwarz, C.: Komplexe Nominalgruppen als Indexierungseinheiten am Beispiel des Projekte CONDOR (1982)
0.04
0.04217028 = product of:
0.08434056 = sum of:
0.08434056 = product of:
0.12651083 = sum of:
0.083298415 = weight(_text_:c in 435) [ClassicSimilarity], result of:
0.083298415 = score(doc=435,freq=2.0), product of:
0.15612034 = queryWeight, product of:
3.4494052 = idf(docFreq=3817, maxDocs=44218)
0.045260075 = queryNorm
0.5335526 = fieldWeight in 435, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.4494052 = idf(docFreq=3817, maxDocs=44218)
0.109375 = fieldNorm(doc=435)
0.043212414 = weight(_text_:h in 435) [ClassicSimilarity], result of:
0.043212414 = score(doc=435,freq=2.0), product of:
0.11244635 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045260075 = queryNorm
0.38429362 = fieldWeight in 435, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.109375 = fieldNorm(doc=435)
0.6666667 = coord(2/3)
0.5 = coord(1/2)
- Source
- Deutscher Dokumentartag 1981, Mainz, 5.-8.10.1981: Kleincomputer in Information und Dokumentation. Bearb.: H. Strohl-Goebel
-
Fuhr, N.: Ranking-Experimente mit gewichteter Indexierung (1986)
0.04
0.036874868 = product of:
0.073749736 = sum of:
0.073749736 = product of:
0.1106246 = sum of:
0.037039213 = weight(_text_:h in 58) [ClassicSimilarity], result of:
0.037039213 = score(doc=58,freq=2.0), product of:
0.11244635 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045260075 = queryNorm
0.32939452 = fieldWeight in 58, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.09375 = fieldNorm(doc=58)
0.07358538 = weight(_text_:22 in 58) [ClassicSimilarity], result of:
0.07358538 = score(doc=58,freq=2.0), product of:
0.15849307 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.045260075 = queryNorm
0.46428138 = fieldWeight in 58, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.09375 = fieldNorm(doc=58)
0.6666667 = coord(2/3)
0.5 = coord(1/2)
- Date
- 14. 6.2015 22:12:44
- Source
- Deutscher Dokumentartag 1985, Nürnberg, 1.-4.10.1985: Fachinformation: Methodik - Management - Markt; neue Entwicklungen, Berufe, Produkte. Bearb.: H. Strohl-Goebel
-
Hauer, M.: Tiefenindexierung im Bibliothekskatalog : 17 Jahre intelligentCAPTURE (2019)
0.04
0.036874868 = product of:
0.073749736 = sum of:
0.073749736 = product of:
0.1106246 = sum of:
0.037039213 = weight(_text_:h in 5629) [ClassicSimilarity], result of:
0.037039213 = score(doc=5629,freq=2.0), product of:
0.11244635 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045260075 = queryNorm
0.32939452 = fieldWeight in 5629, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.09375 = fieldNorm(doc=5629)
0.07358538 = weight(_text_:22 in 5629) [ClassicSimilarity], result of:
0.07358538 = score(doc=5629,freq=2.0), product of:
0.15849307 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.045260075 = queryNorm
0.46428138 = fieldWeight in 5629, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.09375 = fieldNorm(doc=5629)
0.6666667 = coord(2/3)
0.5 = coord(1/2)
- Source
- B.I.T.online. 22(2019) H.2, S.163-166
-
Martins, A.L.; Souza, R.R.; Ribeiro de Mello, H.: ¬The use of noun phrases in information retrieval : proposing a mechanism for automatic classification (2014)
0.04
0.035266254 = product of:
0.07053251 = sum of:
0.07053251 = sum of:
0.033657644 = weight(_text_:c in 1441) [ClassicSimilarity], result of:
0.033657644 = score(doc=1441,freq=4.0), product of:
0.15612034 = queryWeight, product of:
3.4494052 = idf(docFreq=3817, maxDocs=44218)
0.045260075 = queryNorm
0.21558782 = fieldWeight in 1441, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.4494052 = idf(docFreq=3817, maxDocs=44218)
0.03125 = fieldNorm(doc=1441)
0.012346405 = weight(_text_:h in 1441) [ClassicSimilarity], result of:
0.012346405 = score(doc=1441,freq=2.0), product of:
0.11244635 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045260075 = queryNorm
0.10979818 = fieldWeight in 1441, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.03125 = fieldNorm(doc=1441)
0.02452846 = weight(_text_:22 in 1441) [ClassicSimilarity], result of:
0.02452846 = score(doc=1441,freq=2.0), product of:
0.15849307 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.045260075 = queryNorm
0.15476047 = fieldWeight in 1441, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.03125 = fieldNorm(doc=1441)
0.5 = coord(1/2)
- Abstract
- This paper presents a research on syntactic structures known as noun phrases (NP) being applied to increase the effectiveness and efficiency of the mechanisms for the document's classification. Our hypothesis is the fact that the NP can be used instead of single words as a semantic aggregator to reduce the number of words that will be used for the classification system without losing its semantic coverage, increasing its efficiency. The experiment divided the documents classification process in three phases: a) NP preprocessing b) system training; and c) classification experiments. In the first step, a corpus of digitalized texts was submitted to a natural language processing platform1 in which the part-of-speech tagging was done, and them PERL scripts pertaining to the PALAVRAS package were used to extract the Noun Phrases. The preprocessing also involved the tasks of a) removing NP low meaning pre-modifiers, as quantifiers; b) identification of synonyms and corresponding substitution for common hyperonyms; and c) stemming of the relevant words contained in the NP, for similitude checking with other NPs. The first tests with the resulting documents have demonstrated its effectiveness. We have compared the structural similarity of the documents before and after the whole pre-processing steps of phase one. The texts maintained the consistency with the original and have kept the readability. The second phase involves submitting the modified documents to a SVM algorithm to identify clusters and classify the documents. The classification rules are to be established using a machine learning approach. Finally, tests will be conducted to check the effectiveness of the whole process.
- Source
- Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik
-
Lepsky, K.; Vorhauer, J.: Lingo - ein open source System für die Automatische Indexierung deutschsprachiger Dokumente (2006)
0.02
0.024583243 = product of:
0.049166486 = sum of:
0.049166486 = product of:
0.07374973 = sum of:
0.02469281 = weight(_text_:h in 3581) [ClassicSimilarity], result of:
0.02469281 = score(doc=3581,freq=2.0), product of:
0.11244635 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045260075 = queryNorm
0.21959636 = fieldWeight in 3581, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.0625 = fieldNorm(doc=3581)
0.04905692 = weight(_text_:22 in 3581) [ClassicSimilarity], result of:
0.04905692 = score(doc=3581,freq=2.0), product of:
0.15849307 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.045260075 = queryNorm
0.30952093 = fieldWeight in 3581, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0625 = fieldNorm(doc=3581)
0.6666667 = coord(2/3)
0.5 = coord(1/2)
- Date
- 24. 3.2006 12:22:02
- Source
- ABI-Technik. 26(2006) H.1, S.18-28
-
Probst, M.; Mittelbach, J.: Maschinelle Indexierung in der Sacherschließung wissenschaftlicher Bibliotheken (2006)
0.02
0.024583243 = product of:
0.049166486 = sum of:
0.049166486 = product of:
0.07374973 = sum of:
0.02469281 = weight(_text_:h in 1755) [ClassicSimilarity], result of:
0.02469281 = score(doc=1755,freq=2.0), product of:
0.11244635 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045260075 = queryNorm
0.21959636 = fieldWeight in 1755, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.0625 = fieldNorm(doc=1755)
0.04905692 = weight(_text_:22 in 1755) [ClassicSimilarity], result of:
0.04905692 = score(doc=1755,freq=2.0), product of:
0.15849307 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.045260075 = queryNorm
0.30952093 = fieldWeight in 1755, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0625 = fieldNorm(doc=1755)
0.6666667 = coord(2/3)
0.5 = coord(1/2)
- Date
- 22. 3.2008 12:35:19
- Source
- Bibliothek: Forschung und Praxis. 30(2006) H.2, S.168-176
-
Abdul, H.; Khoo, C.: Automatic indexing of medical literature using phrase matching : an exploratory study
0.02
0.024097301 = product of:
0.048194602 = sum of:
0.048194602 = product of:
0.0722919 = sum of:
0.047599096 = weight(_text_:c in 3601) [ClassicSimilarity], result of:
0.047599096 = score(doc=3601,freq=2.0), product of:
0.15612034 = queryWeight, product of:
3.4494052 = idf(docFreq=3817, maxDocs=44218)
0.045260075 = queryNorm
0.3048872 = fieldWeight in 3601, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.4494052 = idf(docFreq=3817, maxDocs=44218)
0.0625 = fieldNorm(doc=3601)
0.02469281 = weight(_text_:h in 3601) [ClassicSimilarity], result of:
0.02469281 = score(doc=3601,freq=2.0), product of:
0.11244635 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045260075 = queryNorm
0.21959636 = fieldWeight in 3601, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.0625 = fieldNorm(doc=3601)
0.6666667 = coord(2/3)
0.5 = coord(1/2)
-
Krause, J.; Womser-Hacker, C.: PADOK-II : Retrievaltests zur Bewertung von Volltextindexierungsvarianten für das deutsche Patentinformationssystem (1990)
0.02
0.024097301 = product of:
0.048194602 = sum of:
0.048194602 = product of:
0.0722919 = sum of:
0.047599096 = weight(_text_:c in 2653) [ClassicSimilarity], result of:
0.047599096 = score(doc=2653,freq=2.0), product of:
0.15612034 = queryWeight, product of:
3.4494052 = idf(docFreq=3817, maxDocs=44218)
0.045260075 = queryNorm
0.3048872 = fieldWeight in 2653, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.4494052 = idf(docFreq=3817, maxDocs=44218)
0.0625 = fieldNorm(doc=2653)
0.02469281 = weight(_text_:h in 2653) [ClassicSimilarity], result of:
0.02469281 = score(doc=2653,freq=2.0), product of:
0.11244635 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045260075 = queryNorm
0.21959636 = fieldWeight in 2653, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.0625 = fieldNorm(doc=2653)
0.6666667 = coord(2/3)
0.5 = coord(1/2)
- Source
- Nachrichten für Dokumentation. 41(1990) H.1, S.13-19
-
Schöning-Walter, C.: Automatische Erschließungsverfahren für Netzpublikationen : zum Stand der Arbeiten im Projekt PETRUS (2011)
0.02
0.024097301 = product of:
0.048194602 = sum of:
0.048194602 = product of:
0.0722919 = sum of:
0.047599096 = weight(_text_:c in 1714) [ClassicSimilarity], result of:
0.047599096 = score(doc=1714,freq=2.0), product of:
0.15612034 = queryWeight, product of:
3.4494052 = idf(docFreq=3817, maxDocs=44218)
0.045260075 = queryNorm
0.3048872 = fieldWeight in 1714, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.4494052 = idf(docFreq=3817, maxDocs=44218)
0.0625 = fieldNorm(doc=1714)
0.02469281 = weight(_text_:h in 1714) [ClassicSimilarity], result of:
0.02469281 = score(doc=1714,freq=2.0), product of:
0.11244635 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045260075 = queryNorm
0.21959636 = fieldWeight in 1714, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.0625 = fieldNorm(doc=1714)
0.6666667 = coord(2/3)
0.5 = coord(1/2)
- Source
- Dialog mit Bibliotheken. 23(2011) H.1, S.31-36
-
Renz, M.: Automatische Inhaltserschließung im Zeichen von Wissensmanagement (2001)
0.02
0.021510338 = product of:
0.043020677 = sum of:
0.043020677 = product of:
0.06453101 = sum of:
0.021606207 = weight(_text_:h in 5671) [ClassicSimilarity], result of:
0.021606207 = score(doc=5671,freq=2.0), product of:
0.11244635 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045260075 = queryNorm
0.19214681 = fieldWeight in 5671, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.0546875 = fieldNorm(doc=5671)
0.042924806 = weight(_text_:22 in 5671) [ClassicSimilarity], result of:
0.042924806 = score(doc=5671,freq=2.0), product of:
0.15849307 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.045260075 = queryNorm
0.2708308 = fieldWeight in 5671, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0546875 = fieldNorm(doc=5671)
0.6666667 = coord(2/3)
0.5 = coord(1/2)
- Date
- 22. 3.2001 13:14:48
- Source
- nfd Information - Wissenschaft und Praxis. 52(2001) H.2, S.69-78
-
Kasprzik, A.: Voraussetzungen und Anwendungspotentiale einer präzisen Sacherschließung aus Sicht der Wissenschaft (2018)
0.02
0.021510338 = product of:
0.043020677 = sum of:
0.043020677 = product of:
0.06453101 = sum of:
0.021606207 = weight(_text_:h in 5195) [ClassicSimilarity], result of:
0.021606207 = score(doc=5195,freq=2.0), product of:
0.11244635 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045260075 = queryNorm
0.19214681 = fieldWeight in 5195, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.0546875 = fieldNorm(doc=5195)
0.042924806 = weight(_text_:22 in 5195) [ClassicSimilarity], result of:
0.042924806 = score(doc=5195,freq=2.0), product of:
0.15849307 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.045260075 = queryNorm
0.2708308 = fieldWeight in 5195, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0546875 = fieldNorm(doc=5195)
0.6666667 = coord(2/3)
0.5 = coord(1/2)
- Abstract
- Große Aufmerksamkeit richtet sich im Moment auf das Potential von automatisierten Methoden in der Sacherschließung und deren Interaktionsmöglichkeiten mit intellektuellen Methoden. In diesem Kontext befasst sich der vorliegende Beitrag mit den folgenden Fragen: Was sind die Anforderungen an bibliothekarische Metadaten aus Sicht der Wissenschaft? Was wird gebraucht, um den Informationsbedarf der Fachcommunities zu bedienen? Und was bedeutet das entsprechend für die Automatisierung der Metadatenerstellung und -pflege? Dieser Beitrag fasst die von der Autorin eingenommene Position in einem Impulsvortrag und der Podiumsdiskussion beim Workshop der FAG "Erschließung und Informationsvermittlung" des GBV zusammen. Der Workshop fand im Rahmen der 22. Verbundkonferenz des GBV statt.
- Source
- ABI-Technik. 38(2018) H.4, S.332-335
-
Franke-Maier, M.: Anforderungen an die Qualität der Inhaltserschließung im Spannungsfeld von intellektuell und automatisch erzeugten Metadaten (2018)
0.02
0.021510338 = product of:
0.043020677 = sum of:
0.043020677 = product of:
0.06453101 = sum of:
0.021606207 = weight(_text_:h in 5344) [ClassicSimilarity], result of:
0.021606207 = score(doc=5344,freq=2.0), product of:
0.11244635 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045260075 = queryNorm
0.19214681 = fieldWeight in 5344, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.0546875 = fieldNorm(doc=5344)
0.042924806 = weight(_text_:22 in 5344) [ClassicSimilarity], result of:
0.042924806 = score(doc=5344,freq=2.0), product of:
0.15849307 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.045260075 = queryNorm
0.2708308 = fieldWeight in 5344, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0546875 = fieldNorm(doc=5344)
0.6666667 = coord(2/3)
0.5 = coord(1/2)
- Abstract
- Spätestens seit dem Deutschen Bibliothekartag 2018 hat sich die Diskussion zu den automatischen Verfahren der Inhaltserschließung der Deutschen Nationalbibliothek von einer politisch geführten Diskussion in eine Qualitätsdiskussion verwandelt. Der folgende Beitrag beschäftigt sich mit Fragen der Qualität von Inhaltserschließung in digitalen Zeiten, wo heterogene Erzeugnisse unterschiedlicher Verfahren aufeinandertreffen und versucht, wichtige Anforderungen an Qualität zu definieren. Dieser Tagungsbeitrag fasst die vom Autor als Impulse vorgetragenen Ideen beim Workshop der FAG "Erschließung und Informationsvermittlung" des GBV am 29. August 2018 in Kiel zusammen. Der Workshop fand im Rahmen der 22. Verbundkonferenz des GBV statt.
- Source
- ABI-Technik. 38(2018) H.4, S.327-331
-
Plaunt, C.; Norgard, B.A.: ¬An association-based method for automatic indexing with a controlled vocabulary (1998)
0.02
0.020136671 = product of:
0.040273342 = sum of:
0.040273342 = product of:
0.06041001 = sum of:
0.029749434 = weight(_text_:c in 1794) [ClassicSimilarity], result of:
0.029749434 = score(doc=1794,freq=2.0), product of:
0.15612034 = queryWeight, product of:
3.4494052 = idf(docFreq=3817, maxDocs=44218)
0.045260075 = queryNorm
0.1905545 = fieldWeight in 1794, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.4494052 = idf(docFreq=3817, maxDocs=44218)
0.0390625 = fieldNorm(doc=1794)
0.030660577 = weight(_text_:22 in 1794) [ClassicSimilarity], result of:
0.030660577 = score(doc=1794,freq=2.0), product of:
0.15849307 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.045260075 = queryNorm
0.19345059 = fieldWeight in 1794, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0390625 = fieldNorm(doc=1794)
0.6666667 = coord(2/3)
0.5 = coord(1/2)
- Date
- 11. 9.2000 19:53:22
-
Busch, D.: Domänenspezifische hybride automatische Indexierung von bibliographischen Metadaten (2019)
0.02
0.018437434 = product of:
0.036874868 = sum of:
0.036874868 = product of:
0.0553123 = sum of:
0.018519606 = weight(_text_:h in 5628) [ClassicSimilarity], result of:
0.018519606 = score(doc=5628,freq=2.0), product of:
0.11244635 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045260075 = queryNorm
0.16469726 = fieldWeight in 5628, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.046875 = fieldNorm(doc=5628)
0.03679269 = weight(_text_:22 in 5628) [ClassicSimilarity], result of:
0.03679269 = score(doc=5628,freq=2.0), product of:
0.15849307 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.045260075 = queryNorm
0.23214069 = fieldWeight in 5628, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.046875 = fieldNorm(doc=5628)
0.6666667 = coord(2/3)
0.5 = coord(1/2)
- Source
- B.I.T.online. 22(2019) H.6, S.465-469
-
Leung, C.-H.; Kan, W.-K.: ¬A statistical learning approach to automatic indexing of controlled index terms (1997)
0.02
0.018072978 = product of:
0.036145955 = sum of:
0.036145955 = product of:
0.05421893 = sum of:
0.035699323 = weight(_text_:c in 6497) [ClassicSimilarity], result of:
0.035699323 = score(doc=6497,freq=2.0), product of:
0.15612034 = queryWeight, product of:
3.4494052 = idf(docFreq=3817, maxDocs=44218)
0.045260075 = queryNorm
0.22866541 = fieldWeight in 6497, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.4494052 = idf(docFreq=3817, maxDocs=44218)
0.046875 = fieldNorm(doc=6497)
0.018519606 = weight(_text_:h in 6497) [ClassicSimilarity], result of:
0.018519606 = score(doc=6497,freq=2.0), product of:
0.11244635 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045260075 = queryNorm
0.16469726 = fieldWeight in 6497, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.046875 = fieldNorm(doc=6497)
0.6666667 = coord(2/3)
0.5 = coord(1/2)
-
Ladewig, C.; Henkes, M.: Verfahren zur automatischen inhaltlichen Erschließung von elektronischen Texten : ASPECTIX (2001)
0.02
0.018072978 = product of:
0.036145955 = sum of:
0.036145955 = product of:
0.05421893 = sum of:
0.035699323 = weight(_text_:c in 5794) [ClassicSimilarity], result of:
0.035699323 = score(doc=5794,freq=2.0), product of:
0.15612034 = queryWeight, product of:
3.4494052 = idf(docFreq=3817, maxDocs=44218)
0.045260075 = queryNorm
0.22866541 = fieldWeight in 5794, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.4494052 = idf(docFreq=3817, maxDocs=44218)
0.046875 = fieldNorm(doc=5794)
0.018519606 = weight(_text_:h in 5794) [ClassicSimilarity], result of:
0.018519606 = score(doc=5794,freq=2.0), product of:
0.11244635 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045260075 = queryNorm
0.16469726 = fieldWeight in 5794, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.046875 = fieldNorm(doc=5794)
0.6666667 = coord(2/3)
0.5 = coord(1/2)
- Source
- nfd Information - Wissenschaft und Praxis. 52(2001) H.3, S.159-164
-
Cui, H.; Boufford, D.; Selden, P.: Semantic annotation of biosystematics literature without training examples (2010)
0.02
0.018072978 = product of:
0.036145955 = sum of:
0.036145955 = product of:
0.05421893 = sum of:
0.035699323 = weight(_text_:c in 3422) [ClassicSimilarity], result of:
0.035699323 = score(doc=3422,freq=2.0), product of:
0.15612034 = queryWeight, product of:
3.4494052 = idf(docFreq=3817, maxDocs=44218)
0.045260075 = queryNorm
0.22866541 = fieldWeight in 3422, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.4494052 = idf(docFreq=3817, maxDocs=44218)
0.046875 = fieldNorm(doc=3422)
0.018519606 = weight(_text_:h in 3422) [ClassicSimilarity], result of:
0.018519606 = score(doc=3422,freq=2.0), product of:
0.11244635 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045260075 = queryNorm
0.16469726 = fieldWeight in 3422, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.046875 = fieldNorm(doc=3422)
0.6666667 = coord(2/3)
0.5 = coord(1/2)
- Abstract
- This article presents an unsupervised algorithm for semantic annotation of morphological descriptions of whole organisms. The algorithm is able to annotate plain text descriptions with high accuracy at the clause level by exploiting the corpus itself. In other words, the algorithm does not need lexicons, syntactic parsers, training examples, or annotation templates. The evaluation on two real-life description collections in botany and paleontology shows that the algorithm has the following desirable features: (a) reduces/eliminates manual labor required to compile dictionaries and prepare source documents; (b) improves annotation coverage: the algorithm annotates what appears in documents and is not limited by predefined and often incomplete templates; (c) learns clean and reusable concepts: the algorithm learns organ names and character states that can be used to construct reusable domain lexicons, as opposed to collection-dependent patterns whose applicability is often limited to a particular collection; (d) insensitive to collection size; and (e) runs in linear time with respect to the number of clauses to be annotated.
-
Franke-Maier, M.; Beck, C.; Kasprzik, A.; Maas, J.F.; Pielmeier, S.; Wiesenmüller, H: ¬Ein Feuerwerk an Algorithmen und der Startschuss zur Bildung eines Kompetenznetzwerks für maschinelle Erschließung : Bericht zur Fachtagung Netzwerk maschinelle Erschließung an der Deutschen Nationalbibliothek am 10. und 11. Oktober 2019 (2020)
0.02
0.018072978 = product of:
0.036145955 = sum of:
0.036145955 = product of:
0.05421893 = sum of:
0.035699323 = weight(_text_:c in 5851) [ClassicSimilarity], result of:
0.035699323 = score(doc=5851,freq=2.0), product of:
0.15612034 = queryWeight, product of:
3.4494052 = idf(docFreq=3817, maxDocs=44218)
0.045260075 = queryNorm
0.22866541 = fieldWeight in 5851, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.4494052 = idf(docFreq=3817, maxDocs=44218)
0.046875 = fieldNorm(doc=5851)
0.018519606 = weight(_text_:h in 5851) [ClassicSimilarity], result of:
0.018519606 = score(doc=5851,freq=2.0), product of:
0.11244635 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045260075 = queryNorm
0.16469726 = fieldWeight in 5851, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.046875 = fieldNorm(doc=5851)
0.6666667 = coord(2/3)
0.5 = coord(1/2)