-
Lepsky, K.; Vorhauer, J.: Lingo - ein open source System für die Automatische Indexierung deutschsprachiger Dokumente (2006)
0.03
0.026138585 = product of:
0.057504885 = sum of:
0.007251961 = product of:
0.014503922 = sum of:
0.014503922 = weight(_text_:h in 3581) [ClassicSimilarity], result of:
0.014503922 = score(doc=3581,freq=2.0), product of:
0.0660481 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.026584605 = queryNorm
0.21959636 = fieldWeight in 3581, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.0625 = fieldNorm(doc=3581)
0.5 = coord(1/2)
0.0031240587 = weight(_text_:a in 3581) [ClassicSimilarity], result of:
0.0031240587 = score(doc=3581,freq=2.0), product of:
0.030653298 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.026584605 = queryNorm
0.10191591 = fieldWeight in 3581, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.0625 = fieldNorm(doc=3581)
0.0027776284 = weight(_text_:s in 3581) [ClassicSimilarity], result of:
0.0027776284 = score(doc=3581,freq=2.0), product of:
0.028903782 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.026584605 = queryNorm
0.09609913 = fieldWeight in 3581, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.0625 = fieldNorm(doc=3581)
0.02994385 = weight(_text_:k in 3581) [ClassicSimilarity], result of:
0.02994385 = score(doc=3581,freq=2.0), product of:
0.09490114 = queryWeight, product of:
3.569778 = idf(docFreq=3384, maxDocs=44218)
0.026584605 = queryNorm
0.31552678 = fieldWeight in 3581, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.569778 = idf(docFreq=3384, maxDocs=44218)
0.0625 = fieldNorm(doc=3581)
0.014407388 = product of:
0.028814776 = sum of:
0.028814776 = weight(_text_:22 in 3581) [ClassicSimilarity], result of:
0.028814776 = score(doc=3581,freq=2.0), product of:
0.09309476 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.026584605 = queryNorm
0.30952093 = fieldWeight in 3581, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0625 = fieldNorm(doc=3581)
0.5 = coord(1/2)
0.45454547 = coord(5/11)
- Date
- 24. 3.2006 12:22:02
- Source
- ABI-Technik. 26(2006) H.1, S.18-28
- Type
- a
-
Hauer, M.: Automatische Indexierung (2000)
0.03
0.02512222 = product of:
0.069086105 = sum of:
0.0046860883 = weight(_text_:a in 5887) [ClassicSimilarity], result of:
0.0046860883 = score(doc=5887,freq=2.0), product of:
0.030653298 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.026584605 = queryNorm
0.15287387 = fieldWeight in 5887, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.09375 = fieldNorm(doc=5887)
0.03862249 = weight(_text_:r in 5887) [ClassicSimilarity], result of:
0.03862249 = score(doc=5887,freq=2.0), product of:
0.088001914 = queryWeight, product of:
3.3102584 = idf(docFreq=4387, maxDocs=44218)
0.026584605 = queryNorm
0.4388824 = fieldWeight in 5887, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.3102584 = idf(docFreq=4387, maxDocs=44218)
0.09375 = fieldNorm(doc=5887)
0.0041664424 = weight(_text_:s in 5887) [ClassicSimilarity], result of:
0.0041664424 = score(doc=5887,freq=2.0), product of:
0.028903782 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.026584605 = queryNorm
0.14414869 = fieldWeight in 5887, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.09375 = fieldNorm(doc=5887)
0.021611081 = product of:
0.043222163 = sum of:
0.043222163 = weight(_text_:22 in 5887) [ClassicSimilarity], result of:
0.043222163 = score(doc=5887,freq=2.0), product of:
0.09309476 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.026584605 = queryNorm
0.46428138 = fieldWeight in 5887, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.09375 = fieldNorm(doc=5887)
0.5 = coord(1/2)
0.36363637 = coord(4/11)
- Pages
- S.203-212
- Source
- Wissen in Aktion: Wege des Knowledge Managements. 22. Online-Tagung der DGI, Frankfurt am Main, 2.-4.5.2000. Proceedings. Hrsg.: R. Schmidt
- Type
- a
-
Rädler, K.: In Bibliothekskatalogen "googlen" : Integration von Inhaltsverzeichnissen, Volltexten und WEB-Ressourcen in Bibliothekskataloge (2004)
0.02
0.019768672 = product of:
0.043491077 = sum of:
0.004532476 = product of:
0.009064952 = sum of:
0.009064952 = weight(_text_:h in 2432) [ClassicSimilarity], result of:
0.009064952 = score(doc=2432,freq=2.0), product of:
0.0660481 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.026584605 = queryNorm
0.13724773 = fieldWeight in 2432, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.0390625 = fieldNorm(doc=2432)
0.5 = coord(1/2)
0.0027613041 = weight(_text_:a in 2432) [ClassicSimilarity], result of:
0.0027613041 = score(doc=2432,freq=4.0), product of:
0.030653298 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.026584605 = queryNorm
0.090081796 = fieldWeight in 2432, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.0390625 = fieldNorm(doc=2432)
0.0017360178 = weight(_text_:s in 2432) [ClassicSimilarity], result of:
0.0017360178 = score(doc=2432,freq=2.0), product of:
0.028903782 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.026584605 = queryNorm
0.060061958 = fieldWeight in 2432, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.0390625 = fieldNorm(doc=2432)
0.015746372 = weight(_text_:u in 2432) [ClassicSimilarity], result of:
0.015746372 = score(doc=2432,freq=2.0), product of:
0.08704981 = queryWeight, product of:
3.2744443 = idf(docFreq=4547, maxDocs=44218)
0.026584605 = queryNorm
0.1808892 = fieldWeight in 2432, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.2744443 = idf(docFreq=4547, maxDocs=44218)
0.0390625 = fieldNorm(doc=2432)
0.018714907 = weight(_text_:k in 2432) [ClassicSimilarity], result of:
0.018714907 = score(doc=2432,freq=2.0), product of:
0.09490114 = queryWeight, product of:
3.569778 = idf(docFreq=3384, maxDocs=44218)
0.026584605 = queryNorm
0.19720423 = fieldWeight in 2432, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.569778 = idf(docFreq=3384, maxDocs=44218)
0.0390625 = fieldNorm(doc=2432)
0.45454547 = coord(5/11)
- Location
- A
- Source
- Bibliotheksdienst. 38(2004) H.7/8, S.927-939
- Theme
- Semantisches Umfeld in Indexierung u. Retrieval
- Type
- a
-
Nohr, H.: Theorie des Information Retrieval II : Automatische Indexierung (2004)
0.02
0.01820914 = product of:
0.040060107 = sum of:
0.004532476 = product of:
0.009064952 = sum of:
0.009064952 = weight(_text_:h in 8) [ClassicSimilarity], result of:
0.009064952 = score(doc=8,freq=2.0), product of:
0.0660481 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.026584605 = queryNorm
0.13724773 = fieldWeight in 8, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.0390625 = fieldNorm(doc=8)
0.5 = coord(1/2)
0.0019525366 = weight(_text_:a in 8) [ClassicSimilarity], result of:
0.0019525366 = score(doc=8,freq=2.0), product of:
0.030653298 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.026584605 = queryNorm
0.06369744 = fieldWeight in 8, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.0390625 = fieldNorm(doc=8)
0.016092705 = weight(_text_:r in 8) [ClassicSimilarity], result of:
0.016092705 = score(doc=8,freq=2.0), product of:
0.088001914 = queryWeight, product of:
3.3102584 = idf(docFreq=4387, maxDocs=44218)
0.026584605 = queryNorm
0.18286766 = fieldWeight in 8, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.3102584 = idf(docFreq=4387, maxDocs=44218)
0.0390625 = fieldNorm(doc=8)
0.0017360178 = weight(_text_:s in 8) [ClassicSimilarity], result of:
0.0017360178 = score(doc=8,freq=2.0), product of:
0.028903782 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.026584605 = queryNorm
0.060061958 = fieldWeight in 8, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.0390625 = fieldNorm(doc=8)
0.015746372 = weight(_text_:u in 8) [ClassicSimilarity], result of:
0.015746372 = score(doc=8,freq=2.0), product of:
0.08704981 = queryWeight, product of:
3.2744443 = idf(docFreq=4547, maxDocs=44218)
0.026584605 = queryNorm
0.1808892 = fieldWeight in 8, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.2744443 = idf(docFreq=4547, maxDocs=44218)
0.0390625 = fieldNorm(doc=8)
0.45454547 = coord(5/11)
- Pages
- S.215-225
- Source
- Grundlagen der praktischen Information und Dokumentation. 5., völlig neu gefaßte Ausgabe. 2 Bde. Hrsg. von R. Kuhlen, Th. Seeger u. D. Strauch. Begründet von Klaus Laisiepen, Ernst Lutterbeck, Karl-Heinrich Meyer-Uhlenried. Bd.1: Handbuch zur Einführung in die Informationswissenschaft und -praxis
- Type
- a
-
Goller, C.; Löning, J.; Will, T.; Wolff, W.: Automatic document classification : a thourough evaluation of various methods (2000)
0.02
0.016556118 = product of:
0.04552932 = sum of:
0.0052392064 = weight(_text_:a in 5480) [ClassicSimilarity], result of:
0.0052392064 = score(doc=5480,freq=10.0), product of:
0.030653298 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.026584605 = queryNorm
0.1709182 = fieldWeight in 5480, product of:
3.1622777 = tf(freq=10.0), with freq of:
10.0 = termFreq=10.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.046875 = fieldNorm(doc=5480)
0.019311246 = weight(_text_:r in 5480) [ClassicSimilarity], result of:
0.019311246 = score(doc=5480,freq=2.0), product of:
0.088001914 = queryWeight, product of:
3.3102584 = idf(docFreq=4387, maxDocs=44218)
0.026584605 = queryNorm
0.2194412 = fieldWeight in 5480, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.3102584 = idf(docFreq=4387, maxDocs=44218)
0.046875 = fieldNorm(doc=5480)
0.0020832212 = weight(_text_:s in 5480) [ClassicSimilarity], result of:
0.0020832212 = score(doc=5480,freq=2.0), product of:
0.028903782 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.026584605 = queryNorm
0.072074346 = fieldWeight in 5480, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.046875 = fieldNorm(doc=5480)
0.018895645 = weight(_text_:u in 5480) [ClassicSimilarity], result of:
0.018895645 = score(doc=5480,freq=2.0), product of:
0.08704981 = queryWeight, product of:
3.2744443 = idf(docFreq=4547, maxDocs=44218)
0.026584605 = queryNorm
0.21706703 = fieldWeight in 5480, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.2744443 = idf(docFreq=4547, maxDocs=44218)
0.046875 = fieldNorm(doc=5480)
0.36363637 = coord(4/11)
- Abstract
- (Automatic) document classification is generally defined as content-based assignment of one or more predefined categories to documents. Usually, machine learning, statistical pattern recognition, or neural network approaches are used to construct classifiers automatically. In this paper we thoroughly evaluate a wide variety of these methods on a document classification task for German text. We evaluate different feature construction and selection methods and various classifiers. Our main results are: (1) feature selection is necessary not only to reduce learning and classification time, but also to avoid overfitting (even for Support Vector Machines); (2) surprisingly, our morphological analysis does not improve classification quality compared to a letter 5-gram approach; (3) Support Vector Machines are significantly better than all other classification methods
- Pages
- S.245-264
- Source
- Informationskompetenz - Basiskompetenz in der Informationsgesellschaft: Proceedings des 7. Internationalen Symposiums für Informationswissenschaft (ISI 2000), Hrsg.: G. Knorz u. R. Kuhlen
- Type
- a
-
Lepsky, K.; Zimmermann, H.H.: Katalogerweiterung durch Scanning und automatische Dokumenterschließung : Ergebnisse des DFG-Projekts KASCADE (2000)
0.02
0.015671818 = product of:
0.043097496 = sum of:
0.007251961 = product of:
0.014503922 = sum of:
0.014503922 = weight(_text_:h in 4966) [ClassicSimilarity], result of:
0.014503922 = score(doc=4966,freq=2.0), product of:
0.0660481 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.026584605 = queryNorm
0.21959636 = fieldWeight in 4966, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.0625 = fieldNorm(doc=4966)
0.5 = coord(1/2)
0.0031240587 = weight(_text_:a in 4966) [ClassicSimilarity], result of:
0.0031240587 = score(doc=4966,freq=2.0), product of:
0.030653298 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.026584605 = queryNorm
0.10191591 = fieldWeight in 4966, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.0625 = fieldNorm(doc=4966)
0.0027776284 = weight(_text_:s in 4966) [ClassicSimilarity], result of:
0.0027776284 = score(doc=4966,freq=2.0), product of:
0.028903782 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.026584605 = queryNorm
0.09609913 = fieldWeight in 4966, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.0625 = fieldNorm(doc=4966)
0.02994385 = weight(_text_:k in 4966) [ClassicSimilarity], result of:
0.02994385 = score(doc=4966,freq=2.0), product of:
0.09490114 = queryWeight, product of:
3.569778 = idf(docFreq=3384, maxDocs=44218)
0.026584605 = queryNorm
0.31552678 = fieldWeight in 4966, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.569778 = idf(docFreq=3384, maxDocs=44218)
0.0625 = fieldNorm(doc=4966)
0.36363637 = coord(4/11)
- Source
- Zeitschrift für Bibliothekswesen und Bibliographie. 47(2000) H.4, S.305-316
- Type
- a
-
Rapke, K.: Automatische Indexierung von Volltexten für die Gruner+Jahr Pressedatenbank (2001)
0.01
0.013998606 = product of:
0.038496166 = sum of:
0.0019525366 = weight(_text_:a in 5863) [ClassicSimilarity], result of:
0.0019525366 = score(doc=5863,freq=2.0), product of:
0.030653298 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.026584605 = queryNorm
0.06369744 = fieldWeight in 5863, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.0390625 = fieldNorm(doc=5863)
0.016092705 = weight(_text_:r in 5863) [ClassicSimilarity], result of:
0.016092705 = score(doc=5863,freq=2.0), product of:
0.088001914 = queryWeight, product of:
3.3102584 = idf(docFreq=4387, maxDocs=44218)
0.026584605 = queryNorm
0.18286766 = fieldWeight in 5863, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.3102584 = idf(docFreq=4387, maxDocs=44218)
0.0390625 = fieldNorm(doc=5863)
0.0017360178 = weight(_text_:s in 5863) [ClassicSimilarity], result of:
0.0017360178 = score(doc=5863,freq=2.0), product of:
0.028903782 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.026584605 = queryNorm
0.060061958 = fieldWeight in 5863, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.0390625 = fieldNorm(doc=5863)
0.018714907 = weight(_text_:k in 5863) [ClassicSimilarity], result of:
0.018714907 = score(doc=5863,freq=2.0), product of:
0.09490114 = queryWeight, product of:
3.569778 = idf(docFreq=3384, maxDocs=44218)
0.026584605 = queryNorm
0.19720423 = fieldWeight in 5863, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.569778 = idf(docFreq=3384, maxDocs=44218)
0.0390625 = fieldNorm(doc=5863)
0.36363637 = coord(4/11)
- Pages
- S.321-342
- Source
- Information Research & Content Management: Orientierung, Ordnung und Organisation im Wissensmarkt; 23. DGI-Online-Tagung der DGI und 53. Jahrestagung der Deutschen Gesellschaft für Informationswissenschaft und Informationspraxis e.V. DGI, Frankfurt am Main, 8.-10.5.2001. Proceedings. Hrsg.: R. Schmidt
- Type
- a
-
Niggemann, E.: Wer suchet, der findet? : Verbesserung der inhaltlichen Suchmöglichkeiten im Informationssystem Der Deutschen Bibliothek (2006)
0.01
0.012613323 = product of:
0.034686636 = sum of:
0.006345466 = product of:
0.012690932 = sum of:
0.012690932 = weight(_text_:h in 5812) [ClassicSimilarity], result of:
0.012690932 = score(doc=5812,freq=2.0), product of:
0.0660481 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.026584605 = queryNorm
0.19214681 = fieldWeight in 5812, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.0546875 = fieldNorm(doc=5812)
0.5 = coord(1/2)
0.003865826 = weight(_text_:a in 5812) [ClassicSimilarity], result of:
0.003865826 = score(doc=5812,freq=4.0), product of:
0.030653298 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.026584605 = queryNorm
0.12611452 = fieldWeight in 5812, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.0546875 = fieldNorm(doc=5812)
0.0024304248 = weight(_text_:s in 5812) [ClassicSimilarity], result of:
0.0024304248 = score(doc=5812,freq=2.0), product of:
0.028903782 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.026584605 = queryNorm
0.08408674 = fieldWeight in 5812, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.0546875 = fieldNorm(doc=5812)
0.02204492 = weight(_text_:u in 5812) [ClassicSimilarity], result of:
0.02204492 = score(doc=5812,freq=2.0), product of:
0.08704981 = queryWeight, product of:
3.2744443 = idf(docFreq=4547, maxDocs=44218)
0.026584605 = queryNorm
0.25324488 = fieldWeight in 5812, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.2744443 = idf(docFreq=4547, maxDocs=44218)
0.0546875 = fieldNorm(doc=5812)
0.36363637 = coord(4/11)
- Abstract
- Elektronische Bibliothekskataloge und Bibliografien haben ihr Monopol bei der Suche nach Büchern, Aufsätzen, musikalischen Werken u. a. verloren. Globale Suchmaschinen sind starke Konkurrenten, und Bibliotheken müssen heute so planen, dass ihre Dienstleistungen auch morgen noch interessant sind. Die Deutsche Bibliothek (DDB) wird ihre traditionelle Katalogrecherche zu einem globalen, netzbasierten Informationssystem erweitern, das die Vorteile der neutralen, qualitätsbasierten Katalogsuche mit den Vorteilen moderner Suchmaschinen zu verbinden sucht. Dieser Beitrag beschäftigt sich mit der Verbesserung der inhaltlichen Suchmöglichkeiten im Informationssystem Der Deutschen Bibliothek. Weitere Entwicklungsstränge sollen nur kurz im Ausblick angerissen werden.
- Pages
- S.107-118
- Source
- Information und Sprache: Beiträge zu Informationswissenschaft, Computerlinguistik, Bibliothekswesen und verwandten Fächern. Festschrift für Harald H. Zimmermann. Herausgegeben von Ilse Harms, Heinz-Dirk Luckhardt und Hans W. Giessen
- Type
- a
-
Rapke, K.: Automatische Indexierung von Volltexten für die Gruner+Jahr Pressedatenbank (2001)
0.01
0.011753863 = product of:
0.032323122 = sum of:
0.0054389704 = product of:
0.010877941 = sum of:
0.010877941 = weight(_text_:h in 6386) [ClassicSimilarity], result of:
0.010877941 = score(doc=6386,freq=2.0), product of:
0.0660481 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.026584605 = queryNorm
0.16469726 = fieldWeight in 6386, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.046875 = fieldNorm(doc=6386)
0.5 = coord(1/2)
0.0023430442 = weight(_text_:a in 6386) [ClassicSimilarity], result of:
0.0023430442 = score(doc=6386,freq=2.0), product of:
0.030653298 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.026584605 = queryNorm
0.07643694 = fieldWeight in 6386, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.046875 = fieldNorm(doc=6386)
0.0020832212 = weight(_text_:s in 6386) [ClassicSimilarity], result of:
0.0020832212 = score(doc=6386,freq=2.0), product of:
0.028903782 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.026584605 = queryNorm
0.072074346 = fieldWeight in 6386, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.046875 = fieldNorm(doc=6386)
0.022457888 = weight(_text_:k in 6386) [ClassicSimilarity], result of:
0.022457888 = score(doc=6386,freq=2.0), product of:
0.09490114 = queryWeight, product of:
3.569778 = idf(docFreq=3384, maxDocs=44218)
0.026584605 = queryNorm
0.23664509 = fieldWeight in 6386, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.569778 = idf(docFreq=3384, maxDocs=44218)
0.046875 = fieldNorm(doc=6386)
0.36363637 = coord(4/11)
- Source
- nfd Information - Wissenschaft und Praxis. 52(2001) H.5, S.251-262
- Type
- a
-
Nohr, H.: Grundlagen der automatischen Indexierung : ein Lehrbuch (2003)
0.01
0.010194249 = product of:
0.028034186 = sum of:
0.005127911 = product of:
0.010255822 = sum of:
0.010255822 = weight(_text_:h in 1767) [ClassicSimilarity], result of:
0.010255822 = score(doc=1767,freq=4.0), product of:
0.0660481 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.026584605 = queryNorm
0.15527807 = fieldWeight in 1767, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.03125 = fieldNorm(doc=1767)
0.5 = coord(1/2)
0.0031054833 = weight(_text_:s in 1767) [ClassicSimilarity], result of:
0.0031054833 = score(doc=1767,freq=10.0), product of:
0.028903782 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.026584605 = queryNorm
0.1074421 = fieldWeight in 1767, product of:
3.1622777 = tf(freq=10.0), with freq of:
10.0 = termFreq=10.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.03125 = fieldNorm(doc=1767)
0.012597097 = weight(_text_:u in 1767) [ClassicSimilarity], result of:
0.012597097 = score(doc=1767,freq=2.0), product of:
0.08704981 = queryWeight, product of:
3.2744443 = idf(docFreq=4547, maxDocs=44218)
0.026584605 = queryNorm
0.14471136 = fieldWeight in 1767, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.2744443 = idf(docFreq=4547, maxDocs=44218)
0.03125 = fieldNorm(doc=1767)
0.007203694 = product of:
0.014407388 = sum of:
0.014407388 = weight(_text_:22 in 1767) [ClassicSimilarity], result of:
0.014407388 = score(doc=1767,freq=2.0), product of:
0.09309476 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.026584605 = queryNorm
0.15476047 = fieldWeight in 1767, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.03125 = fieldNorm(doc=1767)
0.5 = coord(1/2)
0.36363637 = coord(4/11)
- Date
- 22. 6.2009 12:46:51
- Footnote
- Rez. in: nfd 54(2003) H.5, S.314 (W. Ratzek): "Um entscheidungsrelevante Daten aus der ständig wachsenden Flut von mehr oder weniger relevanten Dokumenten zu extrahieren, müssen Unternehmen, öffentliche Verwaltung oder Einrichtungen der Fachinformation effektive und effiziente Filtersysteme entwickeln, einsetzen und pflegen. Das vorliegende Lehrbuch von Holger Nohr bietet erstmalig eine grundlegende Einführung in das Thema "automatische Indexierung". Denn: "Wie man Information sammelt, verwaltet und verwendet, wird darüber entscheiden, ob man zu den Gewinnern oder Verlierern gehört" (Bill Gates), heißt es einleitend. Im ersten Kapitel "Einleitung" stehen die Grundlagen im Mittelpunkt. Die Zusammenhänge zwischen Dokumenten-Management-Systeme, Information Retrieval und Indexierung für Planungs-, Entscheidungs- oder Innovationsprozesse, sowohl in Profit- als auch Non-Profit-Organisationen werden beschrieben. Am Ende des einleitenden Kapitels geht Nohr auf die Diskussion um die intellektuelle und automatische Indexierung ein und leitet damit über zum zweiten Kapitel "automatisches Indexieren. Hier geht der Autor überblickartig unter anderem ein auf - Probleme der automatischen Sprachverarbeitung und Indexierung - verschiedene Verfahren der automatischen Indexierung z.B. einfache Stichwortextraktion / Volltextinvertierung, - statistische Verfahren, Pattern-Matching-Verfahren. Die "Verfahren der automatischen Indexierung" behandelt Nohr dann vertiefend und mit vielen Beispielen versehen im umfangreichsten dritten Kapitel. Das vierte Kapitel "Keyphrase Extraction" nimmt eine Passpartout-Status ein: "Eine Zwischenstufe auf dem Weg von der automatischen Indexierung hin zur automatischen Generierung textueller Zusammenfassungen (Automatic Text Summarization) stellen Ansätze dar, die Schlüsselphrasen aus Dokumenten extrahieren (Keyphrase Extraction). Die Grenzen zwischen den automatischen Verfahren der Indexierung und denen des Text Summarization sind fließend." (S. 91). Am Beispiel NCR"s Extractor/Copernic Summarizer beschreibt Nohr die Funktionsweise.
Im fünften Kapitel "Information Extraction" geht Nohr auf eine Problemstellung ein, die in der Fachwelt eine noch stärkere Betonung verdiente: "Die stetig ansteigende Zahl elektronischer Dokumente macht neben einer automatischen Erschließung auch eine automatische Gewinnung der relevanten Informationen aus diesen Dokumenten wünschenswert, um diese z.B. für weitere Bearbeitungen oder Auswertungen in betriebliche Informationssysteme übernehmen zu können." (S. 103) "Indexierung und Retrievalverfahren" als voneinander abhängige Verfahren werden im sechsten Kapitel behandelt. Hier stehen Relevance Ranking und Relevance Feedback sowie die Anwendung informationslinguistischer Verfahren in der Recherche im Mittelpunkt. Die "Evaluation automatischer Indexierung" setzt den thematischen Schlusspunkt. Hier geht es vor allem um die Oualität einer Indexierung, um gängige Retrievalmaße in Retrievaltest und deren Einssatz. Weiterhin ist hervorzuheben, dass jedes Kapitel durch die Vorgabe von Lernzielen eingeleitet wird und zu den jeweiligen Kapiteln (im hinteren Teil des Buches) einige Kontrollfragen gestellt werden. Die sehr zahlreichen Beispiele aus der Praxis, ein Abkürzungsverzeichnis und ein Sachregister erhöhen den Nutzwert des Buches. Die Lektüre förderte beim Rezensenten das Verständnis für die Zusammenhänge von BID-Handwerkzeug, Wirtschaftsinformatik (insbesondere Data Warehousing) und Künstlicher Intelligenz. Die "Grundlagen der automatischen Indexierung" sollte auch in den bibliothekarischen Studiengängen zur Pflichtlektüre gehören. Holger Nohrs Lehrbuch ist auch für den BID-Profi geeignet, um die mehr oder weniger fundierten Kenntnisse auf dem Gebiet "automatisches Indexieren" schnell, leicht verständlich und informativ aufzufrischen."
- Pages
- 153 S
- Theme
- Grundlagen u. Einführungen: Allgemeine Literatur
-
Probst, M.; Mittelbach, J.: Maschinelle Indexierung in der Sacherschließung wissenschaftlicher Bibliotheken (2006)
0.01
0.010022195 = product of:
0.027561035 = sum of:
0.007251961 = product of:
0.014503922 = sum of:
0.014503922 = weight(_text_:h in 1755) [ClassicSimilarity], result of:
0.014503922 = score(doc=1755,freq=2.0), product of:
0.0660481 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.026584605 = queryNorm
0.21959636 = fieldWeight in 1755, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.0625 = fieldNorm(doc=1755)
0.5 = coord(1/2)
0.0031240587 = weight(_text_:a in 1755) [ClassicSimilarity], result of:
0.0031240587 = score(doc=1755,freq=2.0), product of:
0.030653298 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.026584605 = queryNorm
0.10191591 = fieldWeight in 1755, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.0625 = fieldNorm(doc=1755)
0.0027776284 = weight(_text_:s in 1755) [ClassicSimilarity], result of:
0.0027776284 = score(doc=1755,freq=2.0), product of:
0.028903782 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.026584605 = queryNorm
0.09609913 = fieldWeight in 1755, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.0625 = fieldNorm(doc=1755)
0.014407388 = product of:
0.028814776 = sum of:
0.028814776 = weight(_text_:22 in 1755) [ClassicSimilarity], result of:
0.028814776 = score(doc=1755,freq=2.0), product of:
0.09309476 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.026584605 = queryNorm
0.30952093 = fieldWeight in 1755, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0625 = fieldNorm(doc=1755)
0.5 = coord(1/2)
0.36363637 = coord(4/11)
- Date
- 22. 3.2008 12:35:19
- Source
- Bibliothek: Forschung und Praxis. 30(2006) H.2, S.168-176
- Type
- a
-
Hlava, M.M.K.: Automatic indexing : comparing rule-based and statistics-based indexing systems (2005)
0.01
0.009692967 = product of:
0.03554088 = sum of:
0.005467103 = weight(_text_:a in 6265) [ClassicSimilarity], result of:
0.005467103 = score(doc=6265,freq=2.0), product of:
0.030653298 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.026584605 = queryNorm
0.17835285 = fieldWeight in 6265, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.109375 = fieldNorm(doc=6265)
0.0048608496 = weight(_text_:s in 6265) [ClassicSimilarity], result of:
0.0048608496 = score(doc=6265,freq=2.0), product of:
0.028903782 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.026584605 = queryNorm
0.16817348 = fieldWeight in 6265, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.109375 = fieldNorm(doc=6265)
0.025212929 = product of:
0.050425857 = sum of:
0.050425857 = weight(_text_:22 in 6265) [ClassicSimilarity], result of:
0.050425857 = score(doc=6265,freq=2.0), product of:
0.09309476 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.026584605 = queryNorm
0.5416616 = fieldWeight in 6265, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.109375 = fieldNorm(doc=6265)
0.5 = coord(1/2)
0.27272728 = coord(3/11)
- Source
- Information outlook. 9(2005) no.8, S.22-23
- Type
- a
-
Hauer, M: Silicon Valley Vorarlberg : Maschinelle Indexierung und semantisches Retrieval verbessert den Katalog der Vorarlberger Landesbibliothek (2004)
0.01
0.009009517 = product of:
0.02477617 = sum of:
0.004532476 = product of:
0.009064952 = sum of:
0.009064952 = weight(_text_:h in 2489) [ClassicSimilarity], result of:
0.009064952 = score(doc=2489,freq=2.0), product of:
0.0660481 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.026584605 = queryNorm
0.13724773 = fieldWeight in 2489, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.0390625 = fieldNorm(doc=2489)
0.5 = coord(1/2)
0.0027613041 = weight(_text_:a in 2489) [ClassicSimilarity], result of:
0.0027613041 = score(doc=2489,freq=4.0), product of:
0.030653298 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.026584605 = queryNorm
0.090081796 = fieldWeight in 2489, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.0390625 = fieldNorm(doc=2489)
0.0017360178 = weight(_text_:s in 2489) [ClassicSimilarity], result of:
0.0017360178 = score(doc=2489,freq=2.0), product of:
0.028903782 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.026584605 = queryNorm
0.060061958 = fieldWeight in 2489, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.0390625 = fieldNorm(doc=2489)
0.015746372 = weight(_text_:u in 2489) [ClassicSimilarity], result of:
0.015746372 = score(doc=2489,freq=2.0), product of:
0.08704981 = queryWeight, product of:
3.2744443 = idf(docFreq=4547, maxDocs=44218)
0.026584605 = queryNorm
0.1808892 = fieldWeight in 2489, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.2744443 = idf(docFreq=4547, maxDocs=44218)
0.0390625 = fieldNorm(doc=2489)
0.36363637 = coord(4/11)
- Location
- A
- Source
- Mitteilungen der Vereinigung Österreichischer Bibliothekarinnen und Bibliothekare. 57(2004) H.3/4, S.33-38
- Theme
- Semantisches Umfeld in Indexierung u. Retrieval
- Type
- a
-
Renz, M.: Automatische Inhaltserschließung im Zeichen von Wissensmanagement (2001)
0.01
0.00876942 = product of:
0.024115905 = sum of:
0.006345466 = product of:
0.012690932 = sum of:
0.012690932 = weight(_text_:h in 5671) [ClassicSimilarity], result of:
0.012690932 = score(doc=5671,freq=2.0), product of:
0.0660481 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.026584605 = queryNorm
0.19214681 = fieldWeight in 5671, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.0546875 = fieldNorm(doc=5671)
0.5 = coord(1/2)
0.0027335514 = weight(_text_:a in 5671) [ClassicSimilarity], result of:
0.0027335514 = score(doc=5671,freq=2.0), product of:
0.030653298 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.026584605 = queryNorm
0.089176424 = fieldWeight in 5671, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.0546875 = fieldNorm(doc=5671)
0.0024304248 = weight(_text_:s in 5671) [ClassicSimilarity], result of:
0.0024304248 = score(doc=5671,freq=2.0), product of:
0.028903782 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.026584605 = queryNorm
0.08408674 = fieldWeight in 5671, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.0546875 = fieldNorm(doc=5671)
0.012606464 = product of:
0.025212929 = sum of:
0.025212929 = weight(_text_:22 in 5671) [ClassicSimilarity], result of:
0.025212929 = score(doc=5671,freq=2.0), product of:
0.09309476 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.026584605 = queryNorm
0.2708308 = fieldWeight in 5671, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0546875 = fieldNorm(doc=5671)
0.5 = coord(1/2)
0.36363637 = coord(4/11)
- Date
- 22. 3.2001 13:14:48
- Source
- nfd Information - Wissenschaft und Praxis. 52(2001) H.2, S.69-78
- Type
- a
-
Oberhauser, O.; Labner, J.: OPAC-Erweiterung durch automatische Indexierung : Empirische Untersuchung mit Daten aus dem Österreichischen Verbundkatalog (2002)
0.01
0.007973309 = product of:
0.021926599 = sum of:
0.0054389704 = product of:
0.010877941 = sum of:
0.010877941 = weight(_text_:h in 883) [ClassicSimilarity], result of:
0.010877941 = score(doc=883,freq=2.0), product of:
0.0660481 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.026584605 = queryNorm
0.16469726 = fieldWeight in 883, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.046875 = fieldNorm(doc=883)
0.5 = coord(1/2)
0.0033135647 = weight(_text_:a in 883) [ClassicSimilarity], result of:
0.0033135647 = score(doc=883,freq=4.0), product of:
0.030653298 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.026584605 = queryNorm
0.10809815 = fieldWeight in 883, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.046875 = fieldNorm(doc=883)
0.011090843 = product of:
0.044363372 = sum of:
0.044363372 = weight(_text_:o in 883) [ClassicSimilarity], result of:
0.044363372 = score(doc=883,freq=2.0), product of:
0.13338262 = queryWeight, product of:
5.017288 = idf(docFreq=795, maxDocs=44218)
0.026584605 = queryNorm
0.33260235 = fieldWeight in 883, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
5.017288 = idf(docFreq=795, maxDocs=44218)
0.046875 = fieldNorm(doc=883)
0.25 = coord(1/4)
0.0020832212 = weight(_text_:s in 883) [ClassicSimilarity], result of:
0.0020832212 = score(doc=883,freq=2.0), product of:
0.028903782 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.026584605 = queryNorm
0.072074346 = fieldWeight in 883, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.046875 = fieldNorm(doc=883)
0.36363637 = coord(4/11)
- Location
- A
- Source
- ABI-Technik. 23(2003) H.4, S.305-314
- Type
- a
-
Woltering, H.: Maschinelle Indexierung in der Bibliothek der Friedrich-Ebert-Stiftung (2002)
0.01
0.0077115386 = product of:
0.02827564 = sum of:
0.017947687 = product of:
0.035895374 = sum of:
0.035895374 = weight(_text_:h in 4351) [ClassicSimilarity], result of:
0.035895374 = score(doc=4351,freq=4.0), product of:
0.0660481 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.026584605 = queryNorm
0.54347324 = fieldWeight in 4351, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.109375 = fieldNorm(doc=4351)
0.5 = coord(1/2)
0.005467103 = weight(_text_:a in 4351) [ClassicSimilarity], result of:
0.005467103 = score(doc=4351,freq=2.0), product of:
0.030653298 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.026584605 = queryNorm
0.17835285 = fieldWeight in 4351, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.109375 = fieldNorm(doc=4351)
0.0048608496 = weight(_text_:s in 4351) [ClassicSimilarity], result of:
0.0048608496 = score(doc=4351,freq=2.0), product of:
0.028903782 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.026584605 = queryNorm
0.16817348 = fieldWeight in 4351, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.109375 = fieldNorm(doc=4351)
0.27272728 = coord(3/11)
- Source
- ProLibris. 7(2002) H.3, S.160-161
- Type
- a
-
Nohr, H.: Automatische Indexierung : Einführung in betriebliche Verfahren, Systeme und Anwendungen (2001)
0.01
0.0071422625 = product of:
0.026188295 = sum of:
0.0036259806 = product of:
0.007251961 = sum of:
0.007251961 = weight(_text_:h in 2543) [ClassicSimilarity], result of:
0.007251961 = score(doc=2543,freq=2.0), product of:
0.0660481 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.026584605 = queryNorm
0.10979818 = fieldWeight in 2543, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.03125 = fieldNorm(doc=2543)
0.5 = coord(1/2)
0.0013888142 = weight(_text_:s in 2543) [ClassicSimilarity], result of:
0.0013888142 = score(doc=2543,freq=2.0), product of:
0.028903782 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.026584605 = queryNorm
0.048049565 = fieldWeight in 2543, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.03125 = fieldNorm(doc=2543)
0.0211735 = weight(_text_:k in 2543) [ClassicSimilarity], result of:
0.0211735 = score(doc=2543,freq=4.0), product of:
0.09490114 = queryWeight, product of:
3.569778 = idf(docFreq=3384, maxDocs=44218)
0.026584605 = queryNorm
0.22311112 = fieldWeight in 2543, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.569778 = idf(docFreq=3384, maxDocs=44218)
0.03125 = fieldNorm(doc=2543)
0.27272728 = coord(3/11)
- Classification
- BCAO (FH K)
- GHBS
- BCAO (FH K)
- Pages
- 108 S
-
Gaus, W.; Kaluscha, R.: Maschinelle inhaltliche Erschließung von Arztbriefen und Auswertung von Reha-Entlassungsberichten (2006)
0.01
0.007073087 = product of:
0.019450989 = sum of:
0.0036259806 = product of:
0.007251961 = sum of:
0.007251961 = weight(_text_:h in 6078) [ClassicSimilarity], result of:
0.007251961 = score(doc=6078,freq=2.0), product of:
0.0660481 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.026584605 = queryNorm
0.10979818 = fieldWeight in 6078, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.03125 = fieldNorm(doc=6078)
0.5 = coord(1/2)
0.0015620294 = weight(_text_:a in 6078) [ClassicSimilarity], result of:
0.0015620294 = score(doc=6078,freq=2.0), product of:
0.030653298 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.026584605 = queryNorm
0.050957955 = fieldWeight in 6078, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.03125 = fieldNorm(doc=6078)
0.012874164 = weight(_text_:r in 6078) [ClassicSimilarity], result of:
0.012874164 = score(doc=6078,freq=2.0), product of:
0.088001914 = queryWeight, product of:
3.3102584 = idf(docFreq=4387, maxDocs=44218)
0.026584605 = queryNorm
0.14629413 = fieldWeight in 6078, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.3102584 = idf(docFreq=4387, maxDocs=44218)
0.03125 = fieldNorm(doc=6078)
0.0013888142 = weight(_text_:s in 6078) [ClassicSimilarity], result of:
0.0013888142 = score(doc=6078,freq=2.0), product of:
0.028903782 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.026584605 = queryNorm
0.048049565 = fieldWeight in 6078, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.03125 = fieldNorm(doc=6078)
0.36363637 = coord(4/11)
- Pages
- S.159-168
- Source
- Information und Sprache: Beiträge zu Informationswissenschaft, Computerlinguistik, Bibliothekswesen und verwandten Fächern. Festschrift für Harald H. Zimmermann. Herausgegeben von Ilse Harms, Heinz-Dirk Luckhardt und Hans W. Giessen
- Type
- a
-
Li, W.; Wong, K.-F.; Yuan, C.: Toward automatic Chinese temporal information extraction (2001)
0.01
0.006881903 = product of:
0.025233643 = sum of:
0.004782719 = weight(_text_:a in 6029) [ClassicSimilarity], result of:
0.004782719 = score(doc=6029,freq=12.0), product of:
0.030653298 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.026584605 = queryNorm
0.15602624 = fieldWeight in 6029, product of:
3.4641016 = tf(freq=12.0), with freq of:
12.0 = termFreq=12.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.0390625 = fieldNorm(doc=6029)
0.0017360178 = weight(_text_:s in 6029) [ClassicSimilarity], result of:
0.0017360178 = score(doc=6029,freq=2.0), product of:
0.028903782 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.026584605 = queryNorm
0.060061958 = fieldWeight in 6029, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.0390625 = fieldNorm(doc=6029)
0.018714907 = weight(_text_:k in 6029) [ClassicSimilarity], result of:
0.018714907 = score(doc=6029,freq=2.0), product of:
0.09490114 = queryWeight, product of:
3.569778 = idf(docFreq=3384, maxDocs=44218)
0.026584605 = queryNorm
0.19720423 = fieldWeight in 6029, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.569778 = idf(docFreq=3384, maxDocs=44218)
0.0390625 = fieldNorm(doc=6029)
0.27272728 = coord(3/11)
- Abstract
- Over the past few years, temporal information processing and temporal database management have increasingly become hot topics. Nevertheless, only a few researchers have investigated these areas in the Chinese language. This lays down the objective of our research: to exploit Chinese language processing techniques for temporal information extraction and concept reasoning. In this article, we first study the mechanism for expressing time in Chinese. On the basis of the study, we then design a general frame structure for maintaining the extracted temporal concepts and propose a system for extracting time-dependent information from Hong Kong financial news. In the system, temporal knowledge is represented by different types of temporal concepts (TTC) and different temporal relations, including absolute and relative relations, which are used to correlate between action times and reference times. In analyzing a sentence, the algorithm first determines the situation related to the verb. This in turn will identify the type of temporal concept associated with the verb. After that, the relevant temporal information is extracted and the temporal relations are derived. These relations link relevant concept frames together in chronological order, which in turn provide the knowledge to fulfill users' queries, e.g., for question-answering (i.e., Q&A) applications
- Source
- Journal of the American Society for Information Science and technology. 52(2001) no.9, S.748-762
- Type
- a
-
Lepsky, K.: Automatische Indexierung des Reallexikons zur Deutschen Kunstgeschichte (2006)
0.01
0.0068564205 = product of:
0.018855156 = sum of:
0.003172733 = product of:
0.006345466 = sum of:
0.006345466 = weight(_text_:h in 6080) [ClassicSimilarity], result of:
0.006345466 = score(doc=6080,freq=2.0), product of:
0.0660481 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.026584605 = queryNorm
0.096073404 = fieldWeight in 6080, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.02734375 = fieldNorm(doc=6080)
0.5 = coord(1/2)
0.0013667757 = weight(_text_:a in 6080) [ClassicSimilarity], result of:
0.0013667757 = score(doc=6080,freq=2.0), product of:
0.030653298 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.026584605 = queryNorm
0.044588212 = fieldWeight in 6080, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.02734375 = fieldNorm(doc=6080)
0.0012152124 = weight(_text_:s in 6080) [ClassicSimilarity], result of:
0.0012152124 = score(doc=6080,freq=2.0), product of:
0.028903782 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.026584605 = queryNorm
0.04204337 = fieldWeight in 6080, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.02734375 = fieldNorm(doc=6080)
0.013100435 = weight(_text_:k in 6080) [ClassicSimilarity], result of:
0.013100435 = score(doc=6080,freq=2.0), product of:
0.09490114 = queryWeight, product of:
3.569778 = idf(docFreq=3384, maxDocs=44218)
0.026584605 = queryNorm
0.13804297 = fieldWeight in 6080, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.569778 = idf(docFreq=3384, maxDocs=44218)
0.02734375 = fieldNorm(doc=6080)
0.36363637 = coord(4/11)
- Pages
- S.169-178
- Source
- Information und Sprache: Beiträge zu Informationswissenschaft, Computerlinguistik, Bibliothekswesen und verwandten Fächern. Festschrift für Harald H. Zimmermann. Herausgegeben von Ilse Harms, Heinz-Dirk Luckhardt und Hans W. Giessen
- Type
- a