-
Siebenkäs, A.; Markscheffel, B.: Conception of a workflow for the semi-automatic construction of a thesaurus for the German printing industry (2015)
0.07
0.0667823 = product of:
0.11448395 = sum of:
0.01438466 = product of:
0.04315398 = sum of:
0.04315398 = weight(_text_:f in 2091) [ClassicSimilarity], result of:
0.04315398 = score(doc=2091,freq=2.0), product of:
0.13999219 = queryWeight, product of:
3.985786 = idf(docFreq=2232, maxDocs=44218)
0.035122856 = queryNorm
0.3082599 = fieldWeight in 2091, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.985786 = idf(docFreq=2232, maxDocs=44218)
0.0546875 = fieldNorm(doc=2091)
0.33333334 = coord(1/3)
0.014499208 = weight(_text_:information in 2091) [ClassicSimilarity], result of:
0.014499208 = score(doc=2091,freq=6.0), product of:
0.0616574 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.035122856 = queryNorm
0.23515764 = fieldWeight in 2091, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0546875 = fieldNorm(doc=2091)
0.013343699 = weight(_text_:und in 2091) [ClassicSimilarity], result of:
0.013343699 = score(doc=2091,freq=2.0), product of:
0.07784514 = queryWeight, product of:
2.216367 = idf(docFreq=13101, maxDocs=44218)
0.035122856 = queryNorm
0.17141339 = fieldWeight in 2091, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.216367 = idf(docFreq=13101, maxDocs=44218)
0.0546875 = fieldNorm(doc=2091)
0.019151485 = weight(_text_:for in 2091) [ClassicSimilarity], result of:
0.019151485 = score(doc=2091,freq=8.0), product of:
0.06594466 = queryWeight, product of:
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.035122856 = queryNorm
0.29041752 = fieldWeight in 2091, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.0546875 = fieldNorm(doc=2091)
0.01912591 = weight(_text_:the in 2091) [ClassicSimilarity], result of:
0.01912591 = score(doc=2091,freq=16.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.34513593 = fieldWeight in 2091, product of:
4.0 = tf(freq=16.0), with freq of:
16.0 = termFreq=16.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.0546875 = fieldNorm(doc=2091)
0.0148530835 = weight(_text_:of in 2091) [ClassicSimilarity], result of:
0.0148530835 = score(doc=2091,freq=10.0), product of:
0.054923624 = queryWeight, product of:
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.035122856 = queryNorm
0.2704316 = fieldWeight in 2091, product of:
3.1622777 = tf(freq=10.0), with freq of:
10.0 = termFreq=10.0
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.0546875 = fieldNorm(doc=2091)
0.01912591 = weight(_text_:the in 2091) [ClassicSimilarity], result of:
0.01912591 = score(doc=2091,freq=16.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.34513593 = fieldWeight in 2091, product of:
4.0 = tf(freq=16.0), with freq of:
16.0 = termFreq=16.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.0546875 = fieldNorm(doc=2091)
0.5833333 = coord(7/12)
- Abstract
- During the BMWI granted project "Print-IT", the need of a thesaurus based uniform and consistent language for the German printing industry became evident. In this paper we introduce a semi-automatic construction approach for such a thesaurus and present a workflow which supports users to generate thesaurus typical information structures from relevant digitalized resources with the help of common IT-tools.
- Source
- Re:inventing information science in the networked society: Proceedings of the 14th International Symposium on Information Science, Zadar/Croatia, 19th-21st May 2015. Eds.: F. Pehar, C. Schloegl u. C. Wolff
- Theme
- Konzeption und Anwendung des Prinzips Thesaurus
-
Kutschekmanesch, S.; Lutes, B.; Moelle, K.; Thiel, U.; Tzeras, K.: Automated multilingual indexing : a synthesis of rule-based and thesaurus-based methods (1998)
0.06
0.06207646 = product of:
0.14898351 = sum of:
0.011958744 = weight(_text_:information in 4157) [ClassicSimilarity], result of:
0.011958744 = score(doc=4157,freq=2.0), product of:
0.0616574 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.035122856 = queryNorm
0.19395474 = fieldWeight in 4157, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.078125 = fieldNorm(doc=4157)
0.019062428 = weight(_text_:und in 4157) [ClassicSimilarity], result of:
0.019062428 = score(doc=4157,freq=2.0), product of:
0.07784514 = queryWeight, product of:
2.216367 = idf(docFreq=13101, maxDocs=44218)
0.035122856 = queryNorm
0.24487628 = fieldWeight in 4157, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.216367 = idf(docFreq=13101, maxDocs=44218)
0.078125 = fieldNorm(doc=4157)
0.08467974 = weight(_text_:dokumentation in 4157) [ClassicSimilarity], result of:
0.08467974 = score(doc=4157,freq=2.0), product of:
0.16407113 = queryWeight, product of:
4.671349 = idf(docFreq=1124, maxDocs=44218)
0.035122856 = queryNorm
0.516116 = fieldWeight in 4157, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.671349 = idf(docFreq=1124, maxDocs=44218)
0.078125 = fieldNorm(doc=4157)
0.009489287 = weight(_text_:of in 4157) [ClassicSimilarity], result of:
0.009489287 = score(doc=4157,freq=2.0), product of:
0.054923624 = queryWeight, product of:
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.035122856 = queryNorm
0.17277241 = fieldWeight in 4157, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.078125 = fieldNorm(doc=4157)
0.023793312 = product of:
0.047586624 = sum of:
0.047586624 = weight(_text_:22 in 4157) [ClassicSimilarity], result of:
0.047586624 = score(doc=4157,freq=2.0), product of:
0.12299426 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.035122856 = queryNorm
0.38690117 = fieldWeight in 4157, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.078125 = fieldNorm(doc=4157)
0.5 = coord(1/2)
0.41666666 = coord(5/12)
- Source
- Information und Märkte: 50. Deutscher Dokumentartag 1998, Kongreß der Deutschen Gesellschaft für Dokumentation e.V. (DGD), Rheinische Friedrich-Wilhelms-Universität Bonn, 22.-24. September 1998. Hrsg. von Marlies Ockenfeld u. Gerhard J. Mantwill
-
Milstead, J.L.: Thesauri in a full-text world (1998)
0.06
0.05910142 = product of:
0.10131673 = sum of:
0.011958744 = weight(_text_:information in 2337) [ClassicSimilarity], result of:
0.011958744 = score(doc=2337,freq=8.0), product of:
0.0616574 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.035122856 = queryNorm
0.19395474 = fieldWeight in 2337, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0390625 = fieldNorm(doc=2337)
0.009531214 = weight(_text_:und in 2337) [ClassicSimilarity], result of:
0.009531214 = score(doc=2337,freq=2.0), product of:
0.07784514 = queryWeight, product of:
2.216367 = idf(docFreq=13101, maxDocs=44218)
0.035122856 = queryNorm
0.12243814 = fieldWeight in 2337, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.216367 = idf(docFreq=13101, maxDocs=44218)
0.0390625 = fieldNorm(doc=2337)
0.018096453 = weight(_text_:for in 2337) [ClassicSimilarity], result of:
0.018096453 = score(doc=2337,freq=14.0), product of:
0.06594466 = queryWeight, product of:
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.035122856 = queryNorm
0.27441877 = fieldWeight in 2337, product of:
3.7416575 = tf(freq=14.0), with freq of:
14.0 = termFreq=14.0
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.0390625 = fieldNorm(doc=2337)
0.017414892 = weight(_text_:the in 2337) [ClassicSimilarity], result of:
0.017414892 = score(doc=2337,freq=26.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.3142598 = fieldWeight in 2337, product of:
5.0990195 = tf(freq=26.0), with freq of:
26.0 = termFreq=26.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.0390625 = fieldNorm(doc=2337)
0.015003879 = weight(_text_:of in 2337) [ClassicSimilarity], result of:
0.015003879 = score(doc=2337,freq=20.0), product of:
0.054923624 = queryWeight, product of:
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.035122856 = queryNorm
0.27317715 = fieldWeight in 2337, product of:
4.472136 = tf(freq=20.0), with freq of:
20.0 = termFreq=20.0
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.0390625 = fieldNorm(doc=2337)
0.017414892 = weight(_text_:the in 2337) [ClassicSimilarity], result of:
0.017414892 = score(doc=2337,freq=26.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.3142598 = fieldWeight in 2337, product of:
5.0990195 = tf(freq=26.0), with freq of:
26.0 = termFreq=26.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.0390625 = fieldNorm(doc=2337)
0.011896656 = product of:
0.023793312 = sum of:
0.023793312 = weight(_text_:22 in 2337) [ClassicSimilarity], result of:
0.023793312 = score(doc=2337,freq=2.0), product of:
0.12299426 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.035122856 = queryNorm
0.19345059 = fieldWeight in 2337, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0390625 = fieldNorm(doc=2337)
0.5 = coord(1/2)
0.5833333 = coord(7/12)
- Abstract
- Despite early claims to the contemporary, thesauri continue to find use as access tools for information in the full-text environment. Their mode of use is changing, but this change actually represents an expansion rather than a contrdiction of their utility. Thesauri and similar vocabulary tools can complement full-text access by aiding users in focusing their searches, by supplementing the linguistic analysis of the text search engine, and even by serving as one of the tools used by the linguistic engine for its analysis. While human indexing contunues to be used for many databases, the trend is to increase the use of machine aids for this purpose. All machine-aided indexing (MAI) systems rely on thesauri as the basis for term selection. In the 21st century, the balance of effort between human and machine will change at both input and output, but thesauri will continue to play an important role for the foreseeable future
- Date
- 22. 9.1997 19:16:05
- Imprint
- Urbana-Champaign, IL : Illinois University at Urbana-Champaign, Graduate School of Library and Information Science
- Source
- Visualizing subject access for 21st century information resources: Papers presented at the 1997 Clinic on Library Applications of Data Processing, 2-4 Mar 1997, Graduate School of Library and Information Science, University of Illinois at Urbana-Champaign. Ed.: P.A. Cochrane et al
- Theme
- Konzeption und Anwendung des Prinzips Thesaurus
-
Can, F.: Incremental clustering for dynamic information processing (1993)
0.05
0.0549512 = product of:
0.1099024 = sum of:
0.016439613 = product of:
0.049318835 = sum of:
0.049318835 = weight(_text_:f in 6627) [ClassicSimilarity], result of:
0.049318835 = score(doc=6627,freq=2.0), product of:
0.13999219 = queryWeight, product of:
3.985786 = idf(docFreq=2232, maxDocs=44218)
0.035122856 = queryNorm
0.35229704 = fieldWeight in 6627, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.985786 = idf(docFreq=2232, maxDocs=44218)
0.0625 = fieldNorm(doc=6627)
0.33333334 = coord(1/3)
0.013529775 = weight(_text_:information in 6627) [ClassicSimilarity], result of:
0.013529775 = score(doc=6627,freq=4.0), product of:
0.0616574 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.035122856 = queryNorm
0.21943474 = fieldWeight in 6627, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0625 = fieldNorm(doc=6627)
0.018955056 = weight(_text_:for in 6627) [ClassicSimilarity], result of:
0.018955056 = score(doc=6627,freq=6.0), product of:
0.06594466 = queryWeight, product of:
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.035122856 = queryNorm
0.28743884 = fieldWeight in 6627, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.0625 = fieldNorm(doc=6627)
0.020446459 = weight(_text_:the in 6627) [ClassicSimilarity], result of:
0.020446459 = score(doc=6627,freq=14.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.36896583 = fieldWeight in 6627, product of:
3.7416575 = tf(freq=14.0), with freq of:
14.0 = termFreq=14.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.0625 = fieldNorm(doc=6627)
0.020085035 = weight(_text_:of in 6627) [ClassicSimilarity], result of:
0.020085035 = score(doc=6627,freq=14.0), product of:
0.054923624 = queryWeight, product of:
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.035122856 = queryNorm
0.36569026 = fieldWeight in 6627, product of:
3.7416575 = tf(freq=14.0), with freq of:
14.0 = termFreq=14.0
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.0625 = fieldNorm(doc=6627)
0.020446459 = weight(_text_:the in 6627) [ClassicSimilarity], result of:
0.020446459 = score(doc=6627,freq=14.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.36896583 = fieldWeight in 6627, product of:
3.7416575 = tf(freq=14.0), with freq of:
14.0 = termFreq=14.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.0625 = fieldNorm(doc=6627)
0.5 = coord(6/12)
- Abstract
- Clustering of very large document databases is useful for both searching and browsing. The periodic updating of clusters is required due to the dynamic nature of databases. Introduces an algorithm for incremental clustering and discusses the complexity and cost of analysis of the algorithm together with an investigation of its expected behaviour. Shows through empirical testing that the algortihm achieves cost effectiveness and generates statistically valid clusters that are compatible with those of reclustering. The experimental evidence shows that the algorithm creates an effective and effecient retrieval environment
- Source
- ACM transactions on information systems. 11(1993) no.2, S.143-164
-
Newman, D.J.; Block, S.: Probabilistic topic decomposition of an eighteenth-century American newspaper (2006)
0.05
0.053947844 = product of:
0.10789569 = sum of:
0.0118385535 = weight(_text_:information in 5291) [ClassicSimilarity], result of:
0.0118385535 = score(doc=5291,freq=4.0), product of:
0.0616574 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.035122856 = queryNorm
0.1920054 = fieldWeight in 5291, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0546875 = fieldNorm(doc=5291)
0.013542145 = weight(_text_:for in 5291) [ClassicSimilarity], result of:
0.013542145 = score(doc=5291,freq=4.0), product of:
0.06594466 = queryWeight, product of:
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.035122856 = queryNorm
0.20535621 = fieldWeight in 5291, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.0546875 = fieldNorm(doc=5291)
0.02242712 = weight(_text_:the in 5291) [ClassicSimilarity], result of:
0.02242712 = score(doc=5291,freq=22.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.40470776 = fieldWeight in 5291, product of:
4.690416 = tf(freq=22.0), with freq of:
22.0 = termFreq=22.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.0546875 = fieldNorm(doc=5291)
0.021005431 = weight(_text_:of in 5291) [ClassicSimilarity], result of:
0.021005431 = score(doc=5291,freq=20.0), product of:
0.054923624 = queryWeight, product of:
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.035122856 = queryNorm
0.38244802 = fieldWeight in 5291, product of:
4.472136 = tf(freq=20.0), with freq of:
20.0 = termFreq=20.0
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.0546875 = fieldNorm(doc=5291)
0.02242712 = weight(_text_:the in 5291) [ClassicSimilarity], result of:
0.02242712 = score(doc=5291,freq=22.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.40470776 = fieldWeight in 5291, product of:
4.690416 = tf(freq=22.0), with freq of:
22.0 = termFreq=22.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.0546875 = fieldNorm(doc=5291)
0.016655317 = product of:
0.033310633 = sum of:
0.033310633 = weight(_text_:22 in 5291) [ClassicSimilarity], result of:
0.033310633 = score(doc=5291,freq=2.0), product of:
0.12299426 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.035122856 = queryNorm
0.2708308 = fieldWeight in 5291, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0546875 = fieldNorm(doc=5291)
0.5 = coord(1/2)
0.5 = coord(6/12)
- Abstract
- We use a probabilistic mixture decomposition method to determine topics in the Pennsylvania Gazette, a major colonial U.S. newspaper from 1728-1800. We assess the value of several topic decomposition techniques for historical research and compare the accuracy and efficacy of various methods. After determining the topics covered by the 80,000 articles and advertisements in the entire 18th century run of the Gazette, we calculate how the prevalence of those topics changed over time, and give historically relevant examples of our findings. This approach reveals important information about the content of this colonial newspaper, and suggests the value of such approaches to a more complete understanding of early American print culture and society.
- Date
- 22. 7.2006 17:32:00
- Source
- Journal of the American Society for Information Science and Technology. 57(2006) no.6, S.753-767
-
Dresler, W.: Semi-automatische Indexierungssoftware : Möglichkeiten und Grenzen am Beispiel von g.a.d.t.1 (1998)
0.05
0.05090537 = product of:
0.20362148 = sum of:
0.020294663 = weight(_text_:information in 4272) [ClassicSimilarity], result of:
0.020294663 = score(doc=4272,freq=4.0), product of:
0.0616574 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.035122856 = queryNorm
0.3291521 = fieldWeight in 4272, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.09375 = fieldNorm(doc=4272)
0.039620515 = weight(_text_:und in 4272) [ClassicSimilarity], result of:
0.039620515 = score(doc=4272,freq=6.0), product of:
0.07784514 = queryWeight, product of:
2.216367 = idf(docFreq=13101, maxDocs=44218)
0.035122856 = queryNorm
0.5089658 = fieldWeight in 4272, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
2.216367 = idf(docFreq=13101, maxDocs=44218)
0.09375 = fieldNorm(doc=4272)
0.14370629 = weight(_text_:dokumentation in 4272) [ClassicSimilarity], result of:
0.14370629 = score(doc=4272,freq=4.0), product of:
0.16407113 = queryWeight, product of:
4.671349 = idf(docFreq=1124, maxDocs=44218)
0.035122856 = queryNorm
0.875878 = fieldWeight in 4272, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
4.671349 = idf(docFreq=1124, maxDocs=44218)
0.09375 = fieldNorm(doc=4272)
0.25 = coord(3/12)
- Footnote
- Abschlussarbeit am Institut für Information und Dokumentation an der Fachhochschule Potsdam
- Imprint
- Potsdam : Fachhochschule, Institut für Information und Dokumentation
-
Weidenbach, N.: Werkzeuge zur Evaluierung und Optimierung von Regeln zur Automatischen Indexierung : Anwendungssystementwicklung (1994)
0.05
0.049438734 = product of:
0.19775493 = sum of:
0.019133992 = weight(_text_:information in 2768) [ClassicSimilarity], result of:
0.019133992 = score(doc=2768,freq=2.0), product of:
0.0616574 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.035122856 = queryNorm
0.3103276 = fieldWeight in 2768, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.125 = fieldNorm(doc=2768)
0.043133352 = weight(_text_:und in 2768) [ClassicSimilarity], result of:
0.043133352 = score(doc=2768,freq=4.0), product of:
0.07784514 = queryWeight, product of:
2.216367 = idf(docFreq=13101, maxDocs=44218)
0.035122856 = queryNorm
0.55409175 = fieldWeight in 2768, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
2.216367 = idf(docFreq=13101, maxDocs=44218)
0.125 = fieldNorm(doc=2768)
0.13548759 = weight(_text_:dokumentation in 2768) [ClassicSimilarity], result of:
0.13548759 = score(doc=2768,freq=2.0), product of:
0.16407113 = queryWeight, product of:
4.671349 = idf(docFreq=1124, maxDocs=44218)
0.035122856 = queryNorm
0.82578564 = fieldWeight in 2768, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.671349 = idf(docFreq=1124, maxDocs=44218)
0.125 = fieldNorm(doc=2768)
0.25 = coord(3/12)
- Imprint
- Darmstadt : Fachhochschule, Fachbereich Information und Dokumentation
-
Riloff, E.: ¬An empirical study of automated dictionary construction for information extraction in three domains (1996)
0.05
0.048104133 = product of:
0.09620827 = sum of:
0.016570523 = weight(_text_:information in 6752) [ClassicSimilarity], result of:
0.016570523 = score(doc=6752,freq=6.0), product of:
0.0616574 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.035122856 = queryNorm
0.2687516 = fieldWeight in 6752, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0625 = fieldNorm(doc=6752)
0.018955056 = weight(_text_:for in 6752) [ClassicSimilarity], result of:
0.018955056 = score(doc=6752,freq=6.0), product of:
0.06594466 = queryWeight, product of:
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.035122856 = queryNorm
0.28743884 = fieldWeight in 6752, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.0625 = fieldNorm(doc=6752)
0.01545607 = weight(_text_:the in 6752) [ClassicSimilarity], result of:
0.01545607 = score(doc=6752,freq=8.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.27891195 = fieldWeight in 6752, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.0625 = fieldNorm(doc=6752)
0.010735902 = weight(_text_:of in 6752) [ClassicSimilarity], result of:
0.010735902 = score(doc=6752,freq=4.0), product of:
0.054923624 = queryWeight, product of:
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.035122856 = queryNorm
0.19546966 = fieldWeight in 6752, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.0625 = fieldNorm(doc=6752)
0.01545607 = weight(_text_:the in 6752) [ClassicSimilarity], result of:
0.01545607 = score(doc=6752,freq=8.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.27891195 = fieldWeight in 6752, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.0625 = fieldNorm(doc=6752)
0.019034648 = product of:
0.038069297 = sum of:
0.038069297 = weight(_text_:22 in 6752) [ClassicSimilarity], result of:
0.038069297 = score(doc=6752,freq=2.0), product of:
0.12299426 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.035122856 = queryNorm
0.30952093 = fieldWeight in 6752, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0625 = fieldNorm(doc=6752)
0.5 = coord(1/2)
0.5 = coord(6/12)
- Abstract
- AutoSlog is a system that addresses the knowledge engineering bottleneck for information extraction. AutoSlog automatically creates domain specific dictionaries for information extraction, given an appropriate training corpus. Describes experiments with AutoSlog in terrorism, joint ventures and microelectronics domains. Compares the performance of AutoSlog across the 3 domains, discusses the lessons learned and presents results from 2 experiments which demonstrate that novice users can generate effective dictionaries using AutoSlog
- Date
- 6. 3.1997 16:22:15
-
Ward, M.L.: ¬The future of the human indexer (1996)
0.05
0.04705239 = product of:
0.09410478 = sum of:
0.007175247 = weight(_text_:information in 7244) [ClassicSimilarity], result of:
0.007175247 = score(doc=7244,freq=2.0), product of:
0.0616574 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.035122856 = queryNorm
0.116372846 = fieldWeight in 7244, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.046875 = fieldNorm(doc=7244)
0.014216291 = weight(_text_:for in 7244) [ClassicSimilarity], result of:
0.014216291 = score(doc=7244,freq=6.0), product of:
0.06594466 = queryWeight, product of:
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.035122856 = queryNorm
0.21557912 = fieldWeight in 7244, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.046875 = fieldNorm(doc=7244)
0.021686744 = weight(_text_:the in 7244) [ClassicSimilarity], result of:
0.021686744 = score(doc=7244,freq=28.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.39134735 = fieldWeight in 7244, product of:
5.2915025 = tf(freq=28.0), with freq of:
28.0 = termFreq=28.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.046875 = fieldNorm(doc=7244)
0.015063776 = weight(_text_:of in 7244) [ClassicSimilarity], result of:
0.015063776 = score(doc=7244,freq=14.0), product of:
0.054923624 = queryWeight, product of:
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.035122856 = queryNorm
0.2742677 = fieldWeight in 7244, product of:
3.7416575 = tf(freq=14.0), with freq of:
14.0 = termFreq=14.0
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.046875 = fieldNorm(doc=7244)
0.021686744 = weight(_text_:the in 7244) [ClassicSimilarity], result of:
0.021686744 = score(doc=7244,freq=28.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.39134735 = fieldWeight in 7244, product of:
5.2915025 = tf(freq=28.0), with freq of:
28.0 = termFreq=28.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.046875 = fieldNorm(doc=7244)
0.014275986 = product of:
0.028551972 = sum of:
0.028551972 = weight(_text_:22 in 7244) [ClassicSimilarity], result of:
0.028551972 = score(doc=7244,freq=2.0), product of:
0.12299426 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.035122856 = queryNorm
0.23214069 = fieldWeight in 7244, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.046875 = fieldNorm(doc=7244)
0.5 = coord(1/2)
0.5 = coord(6/12)
- Abstract
- Considers the principles of indexing and the intellectual skills involved in order to determine what automatic indexing systems would be required in order to supplant or complement the human indexer. Good indexing requires: considerable prior knowledge of the literature; judgement as to what to index and what depth to index; reading skills; abstracting skills; and classification skills, Illustrates these features with a detailed description of abstracting and indexing processes involved in generating entries for the mechanical engineering database POWERLINK. Briefly assesses the possibility of replacing human indexers with specialist indexing software, with particular reference to the Object Analyzer from the InTEXT automatic indexing system and using the criteria described for human indexers. At present, it is unlikely that the automatic indexer will replace the human indexer, but when more primary texts are available in electronic form, it may be a useful productivity tool for dealing with large quantities of low grade texts (should they be wanted in the database)
- Date
- 9. 2.1997 18:44:22
- Source
- Journal of librarianship and information science. 28(1996) no.4, S.217-225
-
Klinger, K.-H.: Automatische Inhaltserschließung einer Volltextdatenbank : Machbarkeitsstudie am Beispiel der FAZ (1994)
0.05
0.046280365 = product of:
0.18512146 = sum of:
0.019133992 = weight(_text_:information in 2766) [ClassicSimilarity], result of:
0.019133992 = score(doc=2766,freq=2.0), product of:
0.0616574 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.035122856 = queryNorm
0.3103276 = fieldWeight in 2766, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.125 = fieldNorm(doc=2766)
0.030499885 = weight(_text_:und in 2766) [ClassicSimilarity], result of:
0.030499885 = score(doc=2766,freq=2.0), product of:
0.07784514 = queryWeight, product of:
2.216367 = idf(docFreq=13101, maxDocs=44218)
0.035122856 = queryNorm
0.39180204 = fieldWeight in 2766, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.216367 = idf(docFreq=13101, maxDocs=44218)
0.125 = fieldNorm(doc=2766)
0.13548759 = weight(_text_:dokumentation in 2766) [ClassicSimilarity], result of:
0.13548759 = score(doc=2766,freq=2.0), product of:
0.16407113 = queryWeight, product of:
4.671349 = idf(docFreq=1124, maxDocs=44218)
0.035122856 = queryNorm
0.82578564 = fieldWeight in 2766, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.671349 = idf(docFreq=1124, maxDocs=44218)
0.125 = fieldNorm(doc=2766)
0.25 = coord(3/12)
- Imprint
- Darmstadt : Fachhochschule, Fachbereich Information und Dokumentation
-
Lepsky, K.: Automatische Indexierung (2012)
0.05
0.046280365 = product of:
0.18512146 = sum of:
0.019133992 = weight(_text_:information in 442) [ClassicSimilarity], result of:
0.019133992 = score(doc=442,freq=2.0), product of:
0.0616574 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.035122856 = queryNorm
0.3103276 = fieldWeight in 442, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.125 = fieldNorm(doc=442)
0.030499885 = weight(_text_:und in 442) [ClassicSimilarity], result of:
0.030499885 = score(doc=442,freq=2.0), product of:
0.07784514 = queryWeight, product of:
2.216367 = idf(docFreq=13101, maxDocs=44218)
0.035122856 = queryNorm
0.39180204 = fieldWeight in 442, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.216367 = idf(docFreq=13101, maxDocs=44218)
0.125 = fieldNorm(doc=442)
0.13548759 = weight(_text_:dokumentation in 442) [ClassicSimilarity], result of:
0.13548759 = score(doc=442,freq=2.0), product of:
0.16407113 = queryWeight, product of:
4.671349 = idf(docFreq=1124, maxDocs=44218)
0.035122856 = queryNorm
0.82578564 = fieldWeight in 442, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.671349 = idf(docFreq=1124, maxDocs=44218)
0.125 = fieldNorm(doc=442)
0.25 = coord(3/12)
- Source
- Grundlagen der praktischen Information und Dokumentation. 6. Aufl
-
Li, W.; Wong, K.-F.; Yuan, C.: Toward automatic Chinese temporal information extraction (2001)
0.05
0.04605698 = product of:
0.09211396 = sum of:
0.010274758 = product of:
0.030824272 = sum of:
0.030824272 = weight(_text_:f in 6029) [ClassicSimilarity], result of:
0.030824272 = score(doc=6029,freq=2.0), product of:
0.13999219 = queryWeight, product of:
3.985786 = idf(docFreq=2232, maxDocs=44218)
0.035122856 = queryNorm
0.22018565 = fieldWeight in 6029, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.985786 = idf(docFreq=2232, maxDocs=44218)
0.0390625 = fieldNorm(doc=6029)
0.33333334 = coord(1/3)
0.014646411 = weight(_text_:information in 6029) [ClassicSimilarity], result of:
0.014646411 = score(doc=6029,freq=12.0), product of:
0.0616574 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.035122856 = queryNorm
0.23754507 = fieldWeight in 6029, product of:
3.4641016 = tf(freq=12.0), with freq of:
12.0 = termFreq=12.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0390625 = fieldNorm(doc=6029)
0.01675406 = weight(_text_:for in 6029) [ClassicSimilarity], result of:
0.01675406 = score(doc=6029,freq=12.0), product of:
0.06594466 = queryWeight, product of:
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.035122856 = queryNorm
0.2540624 = fieldWeight in 6029, product of:
3.4641016 = tf(freq=12.0), with freq of:
12.0 = termFreq=12.0
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.0390625 = fieldNorm(doc=6029)
0.01991469 = weight(_text_:the in 6029) [ClassicSimilarity], result of:
0.01991469 = score(doc=6029,freq=34.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.3593698 = fieldWeight in 6029, product of:
5.8309517 = tf(freq=34.0), with freq of:
34.0 = termFreq=34.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.0390625 = fieldNorm(doc=6029)
0.010609345 = weight(_text_:of in 6029) [ClassicSimilarity], result of:
0.010609345 = score(doc=6029,freq=10.0), product of:
0.054923624 = queryWeight, product of:
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.035122856 = queryNorm
0.19316542 = fieldWeight in 6029, product of:
3.1622777 = tf(freq=10.0), with freq of:
10.0 = termFreq=10.0
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.0390625 = fieldNorm(doc=6029)
0.01991469 = weight(_text_:the in 6029) [ClassicSimilarity], result of:
0.01991469 = score(doc=6029,freq=34.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.3593698 = fieldWeight in 6029, product of:
5.8309517 = tf(freq=34.0), with freq of:
34.0 = termFreq=34.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.0390625 = fieldNorm(doc=6029)
0.5 = coord(6/12)
- Abstract
- Over the past few years, temporal information processing and temporal database management have increasingly become hot topics. Nevertheless, only a few researchers have investigated these areas in the Chinese language. This lays down the objective of our research: to exploit Chinese language processing techniques for temporal information extraction and concept reasoning. In this article, we first study the mechanism for expressing time in Chinese. On the basis of the study, we then design a general frame structure for maintaining the extracted temporal concepts and propose a system for extracting time-dependent information from Hong Kong financial news. In the system, temporal knowledge is represented by different types of temporal concepts (TTC) and different temporal relations, including absolute and relative relations, which are used to correlate between action times and reference times. In analyzing a sentence, the algorithm first determines the situation related to the verb. This in turn will identify the type of temporal concept associated with the verb. After that, the relevant temporal information is extracted and the temporal relations are derived. These relations link relevant concept frames together in chronological order, which in turn provide the knowledge to fulfill users' queries, e.g., for question-answering (i.e., Q&A) applications
- Source
- Journal of the American Society for Information Science and technology. 52(2001) no.9, S.748-762
-
Lepsky, K.; Siepmann, J.; Zimmermann, A.: Automatische Indexierung für Online-Kataloge : Ergebnisse eines Retrievaltests (1996)
0.05
0.045498945 = product of:
0.09099789 = sum of:
0.008371122 = weight(_text_:information in 3251) [ClassicSimilarity], result of:
0.008371122 = score(doc=3251,freq=2.0), product of:
0.0616574 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.035122856 = queryNorm
0.13576832 = fieldWeight in 3251, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0546875 = fieldNorm(doc=3251)
0.013343699 = weight(_text_:und in 3251) [ClassicSimilarity], result of:
0.013343699 = score(doc=3251,freq=2.0), product of:
0.07784514 = queryWeight, product of:
2.216367 = idf(docFreq=13101, maxDocs=44218)
0.035122856 = queryNorm
0.17141339 = fieldWeight in 3251, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.216367 = idf(docFreq=13101, maxDocs=44218)
0.0546875 = fieldNorm(doc=3251)
0.009575742 = weight(_text_:for in 3251) [ClassicSimilarity], result of:
0.009575742 = score(doc=3251,freq=2.0), product of:
0.06594466 = queryWeight, product of:
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.035122856 = queryNorm
0.14520876 = fieldWeight in 3251, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.0546875 = fieldNorm(doc=3251)
0.02242712 = weight(_text_:the in 3251) [ClassicSimilarity], result of:
0.02242712 = score(doc=3251,freq=22.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.40470776 = fieldWeight in 3251, product of:
4.690416 = tf(freq=22.0), with freq of:
22.0 = termFreq=22.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.0546875 = fieldNorm(doc=3251)
0.0148530835 = weight(_text_:of in 3251) [ClassicSimilarity], result of:
0.0148530835 = score(doc=3251,freq=10.0), product of:
0.054923624 = queryWeight, product of:
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.035122856 = queryNorm
0.2704316 = fieldWeight in 3251, product of:
3.1622777 = tf(freq=10.0), with freq of:
10.0 = termFreq=10.0
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.0546875 = fieldNorm(doc=3251)
0.02242712 = weight(_text_:the in 3251) [ClassicSimilarity], result of:
0.02242712 = score(doc=3251,freq=22.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.40470776 = fieldWeight in 3251, product of:
4.690416 = tf(freq=22.0), with freq of:
22.0 = termFreq=22.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.0546875 = fieldNorm(doc=3251)
0.5 = coord(6/12)
- Abstract
- Examines the effectiveness of automated indexing and presents the results of a study of information retrieval from a segment (40.000 items) of the ULB Düsseldorf database. The segment was selected randomly and all the documents included were indexed automatically. The search topics included 50 subject areas ranging from economic growth to alternative energy sources. While there were 876 relevant documents in the database segment for each of the 50 search topics, the recall ranged from 1 to 244 references, with the average being 17.52 documents per topic. Therefore it seems that, in the immediate future, automatic indexing should be used in combination with intellectual indexing
- Source
- Zeitschrift für Bibliothekswesen und Bibliographie. 43(1996) H.1, S.47-56
-
Kim, P.K.: ¬An automatic indexing of compound words based on mutual information for Korean text retrieval (1995)
0.05
0.045104995 = product of:
0.10825199 = sum of:
0.019133992 = weight(_text_:information in 620) [ClassicSimilarity], result of:
0.019133992 = score(doc=620,freq=8.0), product of:
0.0616574 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.035122856 = queryNorm
0.3103276 = fieldWeight in 620, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0625 = fieldNorm(doc=620)
0.026806494 = weight(_text_:for in 620) [ClassicSimilarity], result of:
0.026806494 = score(doc=620,freq=12.0), product of:
0.06594466 = queryWeight, product of:
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.035122856 = queryNorm
0.40649986 = fieldWeight in 620, product of:
3.4641016 = tf(freq=12.0), with freq of:
12.0 = termFreq=12.0
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.0625 = fieldNorm(doc=620)
0.021858184 = weight(_text_:the in 620) [ClassicSimilarity], result of:
0.021858184 = score(doc=620,freq=16.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.39444107 = fieldWeight in 620, product of:
4.0 = tf(freq=16.0), with freq of:
16.0 = termFreq=16.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.0625 = fieldNorm(doc=620)
0.018595127 = weight(_text_:of in 620) [ClassicSimilarity], result of:
0.018595127 = score(doc=620,freq=12.0), product of:
0.054923624 = queryWeight, product of:
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.035122856 = queryNorm
0.33856338 = fieldWeight in 620, product of:
3.4641016 = tf(freq=12.0), with freq of:
12.0 = termFreq=12.0
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.0625 = fieldNorm(doc=620)
0.021858184 = weight(_text_:the in 620) [ClassicSimilarity], result of:
0.021858184 = score(doc=620,freq=16.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.39444107 = fieldWeight in 620, product of:
4.0 = tf(freq=16.0), with freq of:
16.0 = termFreq=16.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.0625 = fieldNorm(doc=620)
0.41666666 = coord(5/12)
- Abstract
- Presents an automatic indexing technique for compound words suitable for an agglutinative language, specifically Korean. Discusses some construction conditions for compound words and the rules for decomposing compound words to enhance the exhaustivity of indexing, demonstrating that this system, mutual information, enhances both the exhaustivity of indexing and the specifity of terms. Suggests that the construction conditions and rules for decomposition presented may be used in multilingual information retrieval systems to translate the indexing terms of the specific language into those of the language required
- Source
- Library and information science. 1995, no.34, S.29-38
-
Polity, Y.: Vers une ergonomie linguistique (1994)
0.04
0.0446906 = product of:
0.0893812 = sum of:
0.016439613 = product of:
0.049318835 = sum of:
0.049318835 = weight(_text_:f in 36) [ClassicSimilarity], result of:
0.049318835 = score(doc=36,freq=2.0), product of:
0.13999219 = queryWeight, product of:
3.985786 = idf(docFreq=2232, maxDocs=44218)
0.035122856 = queryNorm
0.35229704 = fieldWeight in 36, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.985786 = idf(docFreq=2232, maxDocs=44218)
0.0625 = fieldNorm(doc=36)
0.33333334 = coord(1/3)
0.021392453 = weight(_text_:information in 36) [ClassicSimilarity], result of:
0.021392453 = score(doc=36,freq=10.0), product of:
0.0616574 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.035122856 = queryNorm
0.3469568 = fieldWeight in 36, product of:
3.1622777 = tf(freq=10.0), with freq of:
10.0 = termFreq=10.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0625 = fieldNorm(doc=36)
0.018955056 = weight(_text_:for in 36) [ClassicSimilarity], result of:
0.018955056 = score(doc=36,freq=6.0), product of:
0.06594466 = queryWeight, product of:
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.035122856 = queryNorm
0.28743884 = fieldWeight in 36, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.0625 = fieldNorm(doc=36)
0.010929092 = weight(_text_:the in 36) [ClassicSimilarity], result of:
0.010929092 = score(doc=36,freq=4.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.19722053 = fieldWeight in 36, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.0625 = fieldNorm(doc=36)
0.010735902 = weight(_text_:of in 36) [ClassicSimilarity], result of:
0.010735902 = score(doc=36,freq=4.0), product of:
0.054923624 = queryWeight, product of:
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.035122856 = queryNorm
0.19546966 = fieldWeight in 36, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.0625 = fieldNorm(doc=36)
0.010929092 = weight(_text_:the in 36) [ClassicSimilarity], result of:
0.010929092 = score(doc=36,freq=4.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.19722053 = fieldWeight in 36, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.0625 = fieldNorm(doc=36)
0.5 = coord(6/12)
- Abstract
- Analyzed a special type of man-mchine interaction, that of searching an information system with natural language. A model for full text processing for information retrieval was proposed that considered the system's users and how they employ information. Describes how INIST (the National Institute for Scientific and Technical Information) is developing computer assisted indexing as an aid to improving relevance when retrieving information from bibliographic data banks
- Language
- f
-
Croft, W.B.: Clustering large files of documents using the single link method (1977)
0.04
0.04425399 = product of:
0.106209576 = sum of:
0.019133992 = weight(_text_:information in 5489) [ClassicSimilarity], result of:
0.019133992 = score(doc=5489,freq=2.0), product of:
0.0616574 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.035122856 = queryNorm
0.3103276 = fieldWeight in 5489, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.125 = fieldNorm(doc=5489)
0.02188741 = weight(_text_:for in 5489) [ClassicSimilarity], result of:
0.02188741 = score(doc=5489,freq=2.0), product of:
0.06594466 = queryWeight, product of:
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.035122856 = queryNorm
0.33190575 = fieldWeight in 5489, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.125 = fieldNorm(doc=5489)
0.021858184 = weight(_text_:the in 5489) [ClassicSimilarity], result of:
0.021858184 = score(doc=5489,freq=4.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.39444107 = fieldWeight in 5489, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.125 = fieldNorm(doc=5489)
0.021471804 = weight(_text_:of in 5489) [ClassicSimilarity], result of:
0.021471804 = score(doc=5489,freq=4.0), product of:
0.054923624 = queryWeight, product of:
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.035122856 = queryNorm
0.39093933 = fieldWeight in 5489, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.125 = fieldNorm(doc=5489)
0.021858184 = weight(_text_:the in 5489) [ClassicSimilarity], result of:
0.021858184 = score(doc=5489,freq=4.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.39444107 = fieldWeight in 5489, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.125 = fieldNorm(doc=5489)
0.41666666 = coord(5/12)
- Source
- Journal of the American Society for Information Science. 28(1977), S.341-344
-
Chou, C.; Chu, T.: ¬An analysis of BERT (NLP) for assisted subject indexing for Project Gutenberg (2022)
0.04
0.04382915 = product of:
0.105189964 = sum of:
0.019151485 = weight(_text_:for in 1139) [ClassicSimilarity], result of:
0.019151485 = score(doc=1139,freq=8.0), product of:
0.06594466 = queryWeight, product of:
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.035122856 = queryNorm
0.29041752 = fieldWeight in 1139, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.0546875 = fieldNorm(doc=1139)
0.01512036 = weight(_text_:the in 1139) [ClassicSimilarity], result of:
0.01512036 = score(doc=1139,freq=10.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.2728539 = fieldWeight in 1139, product of:
3.1622777 = tf(freq=10.0), with freq of:
10.0 = termFreq=10.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.0546875 = fieldNorm(doc=1139)
0.017574405 = weight(_text_:of in 1139) [ClassicSimilarity], result of:
0.017574405 = score(doc=1139,freq=14.0), product of:
0.054923624 = queryWeight, product of:
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.035122856 = queryNorm
0.31997898 = fieldWeight in 1139, product of:
3.7416575 = tf(freq=14.0), with freq of:
14.0 = termFreq=14.0
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.0546875 = fieldNorm(doc=1139)
0.01512036 = weight(_text_:the in 1139) [ClassicSimilarity], result of:
0.01512036 = score(doc=1139,freq=10.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.2728539 = fieldWeight in 1139, product of:
3.1622777 = tf(freq=10.0), with freq of:
10.0 = termFreq=10.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.0546875 = fieldNorm(doc=1139)
0.038223356 = product of:
0.07644671 = sum of:
0.07644671 = weight(_text_:communities in 1139) [ClassicSimilarity], result of:
0.07644671 = score(doc=1139,freq=2.0), product of:
0.18632571 = queryWeight, product of:
5.3049703 = idf(docFreq=596, maxDocs=44218)
0.035122856 = queryNorm
0.41028535 = fieldWeight in 1139, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
5.3049703 = idf(docFreq=596, maxDocs=44218)
0.0546875 = fieldNorm(doc=1139)
0.5 = coord(1/2)
0.41666666 = coord(5/12)
- Abstract
- In light of AI (Artificial Intelligence) and NLP (Natural language processing) technologies, this article examines the feasibility of using AI/NLP models to enhance the subject indexing of digital resources. While BERT (Bidirectional Encoder Representations from Transformers) models are widely used in scholarly communities, the authors assess whether BERT models can be used in machine-assisted indexing in the Project Gutenberg collection, through suggesting Library of Congress subject headings filtered by certain Library of Congress Classification subclass labels. The findings of this study are informative for further research on BERT models to assist with automatic subject indexing for digital library collections.
-
Dattola, R.T.: FIRST: Flexible information retrieval system for text (1979)
0.04
0.043378346 = product of:
0.10410803 = sum of:
0.02705955 = weight(_text_:information in 5172) [ClassicSimilarity], result of:
0.02705955 = score(doc=5172,freq=4.0), product of:
0.0616574 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.035122856 = queryNorm
0.43886948 = fieldWeight in 5172, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.125 = fieldNorm(doc=5172)
0.030953474 = weight(_text_:for in 5172) [ClassicSimilarity], result of:
0.030953474 = score(doc=5172,freq=4.0), product of:
0.06594466 = queryWeight, product of:
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.035122856 = queryNorm
0.46938562 = fieldWeight in 5172, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.125 = fieldNorm(doc=5172)
0.01545607 = weight(_text_:the in 5172) [ClassicSimilarity], result of:
0.01545607 = score(doc=5172,freq=2.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.27891195 = fieldWeight in 5172, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.125 = fieldNorm(doc=5172)
0.015182858 = weight(_text_:of in 5172) [ClassicSimilarity], result of:
0.015182858 = score(doc=5172,freq=2.0), product of:
0.054923624 = queryWeight, product of:
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.035122856 = queryNorm
0.27643585 = fieldWeight in 5172, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.125 = fieldNorm(doc=5172)
0.01545607 = weight(_text_:the in 5172) [ClassicSimilarity], result of:
0.01545607 = score(doc=5172,freq=2.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.27891195 = fieldWeight in 5172, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.125 = fieldNorm(doc=5172)
0.41666666 = coord(5/12)
- Source
- Journal of the American Society for Information Science. 30(1979), S.9-14
-
Martins, A.L.; Souza, R.R.; Ribeiro de Mello, H.: ¬The use of noun phrases in information retrieval : proposing a mechanism for automatic classification (2014)
0.04
0.042970788 = product of:
0.085941575 = sum of:
0.004783498 = weight(_text_:information in 1441) [ClassicSimilarity], result of:
0.004783498 = score(doc=1441,freq=2.0), product of:
0.0616574 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.035122856 = queryNorm
0.0775819 = fieldWeight in 1441, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.03125 = fieldNorm(doc=1441)
0.012235435 = weight(_text_:for in 1441) [ClassicSimilarity], result of:
0.012235435 = score(doc=1441,freq=10.0), product of:
0.06594466 = queryWeight, product of:
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.035122856 = queryNorm
0.18554096 = fieldWeight in 1441, product of:
3.1622777 = tf(freq=10.0), with freq of:
10.0 = termFreq=10.0
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.03125 = fieldNorm(doc=1441)
0.022859836 = weight(_text_:the in 1441) [ClassicSimilarity], result of:
0.022859836 = score(doc=1441,freq=70.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.41251633 = fieldWeight in 1441, product of:
8.3666 = tf(freq=70.0), with freq of:
70.0 = termFreq=70.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.03125 = fieldNorm(doc=1441)
0.013685644 = weight(_text_:of in 1441) [ClassicSimilarity], result of:
0.013685644 = score(doc=1441,freq=26.0), product of:
0.054923624 = queryWeight, product of:
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.035122856 = queryNorm
0.2491759 = fieldWeight in 1441, product of:
5.0990195 = tf(freq=26.0), with freq of:
26.0 = termFreq=26.0
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.03125 = fieldNorm(doc=1441)
0.022859836 = weight(_text_:the in 1441) [ClassicSimilarity], result of:
0.022859836 = score(doc=1441,freq=70.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.41251633 = fieldWeight in 1441, product of:
8.3666 = tf(freq=70.0), with freq of:
70.0 = termFreq=70.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.03125 = fieldNorm(doc=1441)
0.009517324 = product of:
0.019034648 = sum of:
0.019034648 = weight(_text_:22 in 1441) [ClassicSimilarity], result of:
0.019034648 = score(doc=1441,freq=2.0), product of:
0.12299426 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.035122856 = queryNorm
0.15476047 = fieldWeight in 1441, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.03125 = fieldNorm(doc=1441)
0.5 = coord(1/2)
0.5 = coord(6/12)
- Abstract
- This paper presents a research on syntactic structures known as noun phrases (NP) being applied to increase the effectiveness and efficiency of the mechanisms for the document's classification. Our hypothesis is the fact that the NP can be used instead of single words as a semantic aggregator to reduce the number of words that will be used for the classification system without losing its semantic coverage, increasing its efficiency. The experiment divided the documents classification process in three phases: a) NP preprocessing b) system training; and c) classification experiments. In the first step, a corpus of digitalized texts was submitted to a natural language processing platform1 in which the part-of-speech tagging was done, and them PERL scripts pertaining to the PALAVRAS package were used to extract the Noun Phrases. The preprocessing also involved the tasks of a) removing NP low meaning pre-modifiers, as quantifiers; b) identification of synonyms and corresponding substitution for common hyperonyms; and c) stemming of the relevant words contained in the NP, for similitude checking with other NPs. The first tests with the resulting documents have demonstrated its effectiveness. We have compared the structural similarity of the documents before and after the whole pre-processing steps of phase one. The texts maintained the consistency with the original and have kept the readability. The second phase involves submitting the modified documents to a SVM algorithm to identify clusters and classify the documents. The classification rules are to be established using a machine learning approach. Finally, tests will be conducted to check the effectiveness of the whole process.
- Source
- Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik
-
Advances in intelligent retrieval: Proc. of a conference ... Wadham College, Oxford, 16.-17.4.1985 (1986)
0.04
0.042321783 = product of:
0.084643565 = sum of:
0.017436842 = product of:
0.052310523 = sum of:
0.052310523 = weight(_text_:f in 1384) [ClassicSimilarity], result of:
0.052310523 = score(doc=1384,freq=4.0), product of:
0.13999219 = queryWeight, product of:
3.985786 = idf(docFreq=2232, maxDocs=44218)
0.035122856 = queryNorm
0.37366742 = fieldWeight in 1384, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.985786 = idf(docFreq=2232, maxDocs=44218)
0.046875 = fieldNorm(doc=1384)
0.33333334 = coord(1/3)
0.016044341 = weight(_text_:information in 1384) [ClassicSimilarity], result of:
0.016044341 = score(doc=1384,freq=10.0), product of:
0.0616574 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.035122856 = queryNorm
0.2602176 = fieldWeight in 1384, product of:
3.1622777 = tf(freq=10.0), with freq of:
10.0 = termFreq=10.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.046875 = fieldNorm(doc=1384)
0.018353151 = weight(_text_:for in 1384) [ClassicSimilarity], result of:
0.018353151 = score(doc=1384,freq=10.0), product of:
0.06594466 = queryWeight, product of:
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.035122856 = queryNorm
0.27831143 = fieldWeight in 1384, product of:
3.1622777 = tf(freq=10.0), with freq of:
10.0 = termFreq=10.0
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.046875 = fieldNorm(doc=1384)
0.010039012 = weight(_text_:the in 1384) [ClassicSimilarity], result of:
0.010039012 = score(doc=1384,freq=6.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.18115863 = fieldWeight in 1384, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.046875 = fieldNorm(doc=1384)
0.012731214 = weight(_text_:of in 1384) [ClassicSimilarity], result of:
0.012731214 = score(doc=1384,freq=10.0), product of:
0.054923624 = queryWeight, product of:
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.035122856 = queryNorm
0.23179851 = fieldWeight in 1384, product of:
3.1622777 = tf(freq=10.0), with freq of:
10.0 = termFreq=10.0
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.046875 = fieldNorm(doc=1384)
0.010039012 = weight(_text_:the in 1384) [ClassicSimilarity], result of:
0.010039012 = score(doc=1384,freq=6.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.18115863 = fieldWeight in 1384, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.046875 = fieldNorm(doc=1384)
0.5 = coord(6/12)
- Content
- Enthält die Beiträge: ADDIS, T.: Extended relational analysis: a design approach to knowledge-based systems; PARKINSON, D.: Supercomputers and non-numeric processing; McGREGOR, D.R. u. J.R. MALONE: An architectural approach to advances in information retrieval; ALLEN, M.J. u. O.S. HARRISON: Word processing and information retrieval: some practical problems; MURTAGH, F.: Clustering and nearest neighborhood searching; ENSER, P.G.B.: Experimenting with the automatic classification of books; TESKEY, N. u. Z. RAZAK: An analysis of ranking for free text retrieval systems; ZARRI, G.P.: Interactive information retrieval: an artificial intelligence approach to deal with biographical data; HANCOX, P. u. F. SMITH: A case system processor for the PRECIS indexing language; ROUAULT, J.: Linguistic methods in information retrieval systems; ARAGON-RAMIREZ, V. u. C.D. PAICE: Design of a system for the online elucidation of natural language search statements; BROOKS, H.M., P.J. DANIELS u. N.J. BELKIN: Problem descriptions and user models: developing an intelligent interface for document retrieval systems; BLACK, W.J., P. HARGREAVES u. P.B. MAYES: HEADS: a cataloguing advisory system; BELL, D.A.: An architecture for integrating data, knowledge, and information bases