-
Voorhees, E.M.: Implementing agglomerative hierarchic clustering algorithms for use in document retrieval (1986)
0.10
0.10299414 = product of:
0.20598827 = sum of:
0.025915671 = weight(_text_:information in 402) [ClassicSimilarity], result of:
0.025915671 = score(doc=402,freq=2.0), product of:
0.08351069 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.047571484 = queryNorm
0.3103276 = fieldWeight in 402, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.125 = fieldNorm(doc=402)
0.1800726 = sum of:
0.07694813 = weight(_text_:retrieval in 402) [ClassicSimilarity], result of:
0.07694813 = score(doc=402,freq=2.0), product of:
0.1438997 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.047571484 = queryNorm
0.5347345 = fieldWeight in 402, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.125 = fieldNorm(doc=402)
0.10312447 = weight(_text_:22 in 402) [ClassicSimilarity], result of:
0.10312447 = score(doc=402,freq=2.0), product of:
0.16658723 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.047571484 = queryNorm
0.61904186 = fieldWeight in 402, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.125 = fieldNorm(doc=402)
0.5 = coord(2/4)
- Source
- Information processing and management. 22(1986) no.6, S.465-476
-
Biebricher, N.; Fuhr, N.; Lustig, G.; Schwantner, M.; Knorz, G.: ¬The automatic indexing system AIR/PHYS : from research to application (1988)
0.08
0.077686206 = product of:
0.15537241 = sum of:
0.022906432 = weight(_text_:information in 1952) [ClassicSimilarity], result of:
0.022906432 = score(doc=1952,freq=4.0), product of:
0.08351069 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.047571484 = queryNorm
0.27429342 = fieldWeight in 1952, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.078125 = fieldNorm(doc=1952)
0.13246597 = sum of:
0.06801318 = weight(_text_:retrieval in 1952) [ClassicSimilarity], result of:
0.06801318 = score(doc=1952,freq=4.0), product of:
0.1438997 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.047571484 = queryNorm
0.47264296 = fieldWeight in 1952, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.078125 = fieldNorm(doc=1952)
0.0644528 = weight(_text_:22 in 1952) [ClassicSimilarity], result of:
0.0644528 = score(doc=1952,freq=2.0), product of:
0.16658723 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.047571484 = queryNorm
0.38690117 = fieldWeight in 1952, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.078125 = fieldNorm(doc=1952)
0.5 = coord(2/4)
- Date
- 16. 8.1998 12:51:22
- Footnote
- Wiederabgedruckt in: Readings in information retrieval. Ed.: K. Sparck Jones u. P. Willett. San Francisco: Morgan Kaufmann 1997. S.513-517.
- Source
- Proceedings of the 11th annual conference on research and development in information retrieval. Ed.: Y. Chiaramella
-
Hodges, P.R.: Keyword in title indexes : effectiveness of retrieval in computer searches (1983)
0.06
0.06424054 = product of:
0.12848108 = sum of:
0.016034503 = weight(_text_:information in 5001) [ClassicSimilarity], result of:
0.016034503 = score(doc=5001,freq=4.0), product of:
0.08351069 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.047571484 = queryNorm
0.1920054 = fieldWeight in 5001, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0546875 = fieldNorm(doc=5001)
0.11244657 = sum of:
0.067329615 = weight(_text_:retrieval in 5001) [ClassicSimilarity], result of:
0.067329615 = score(doc=5001,freq=8.0), product of:
0.1438997 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.047571484 = queryNorm
0.46789268 = fieldWeight in 5001, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.0546875 = fieldNorm(doc=5001)
0.045116954 = weight(_text_:22 in 5001) [ClassicSimilarity], result of:
0.045116954 = score(doc=5001,freq=2.0), product of:
0.16658723 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.047571484 = queryNorm
0.2708308 = fieldWeight in 5001, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0546875 = fieldNorm(doc=5001)
0.5 = coord(2/4)
- Abstract
- A study was done to test the effectiveness of retrieval using title word searching. It was based on actual search profiles used in the Mechanized Information Center at Ohio State University, in order ro replicate as closely as possible actual searching conditions. Fewer than 50% of the relevant titles were retrieved by keywords in titles. The low rate of retrieval can be attributes to three sources: titles themselves, user and information specialist ignorance of the subject vocabulary in use, and to general language problems. Across fields it was found that the social sciences had the best retrieval rate, with science having the next best, and arts and humanities the lowest. Ways to enhance and supplement keyword in title searching on the computer and in printed indexes are discussed.
- Date
- 14. 3.1996 13:22:21
-
Bordoni, L.; Pazienza, M.T.: Documents automatic indexing in an environmental domain (1997)
0.05
0.054380342 = product of:
0.108760685 = sum of:
0.016034503 = weight(_text_:information in 530) [ClassicSimilarity], result of:
0.016034503 = score(doc=530,freq=4.0), product of:
0.08351069 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.047571484 = queryNorm
0.1920054 = fieldWeight in 530, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0546875 = fieldNorm(doc=530)
0.09272618 = sum of:
0.047609225 = weight(_text_:retrieval in 530) [ClassicSimilarity], result of:
0.047609225 = score(doc=530,freq=4.0), product of:
0.1438997 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.047571484 = queryNorm
0.33085006 = fieldWeight in 530, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.0546875 = fieldNorm(doc=530)
0.045116954 = weight(_text_:22 in 530) [ClassicSimilarity], result of:
0.045116954 = score(doc=530,freq=2.0), product of:
0.16658723 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.047571484 = queryNorm
0.2708308 = fieldWeight in 530, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0546875 = fieldNorm(doc=530)
0.5 = coord(2/4)
- Abstract
- Describes an application of Natural Language Processing (NLP) techniques, in HIRMA (Hypertextual Information Retrieval Managed by ARIOSTO), to the problem of document indexing by referring to a system which incorporates natural language processing techniques to determine the subject of the text of documents and to associate them with relevant semantic indexes. Describes briefly the overall system, details of its implementation on a corpus of scientific abstracts related to environmental topics and experimental evidence of the system's behaviour. Analyzes in detail an experiment designed to evaluate the system's retrieval ability in terms of recall and precision
- Source
- International forum on information and documentation. 22(1997) no.1, S.17-28
-
Lepsky, K.; Vorhauer, J.: Lingo - ein open source System für die Automatische Indexierung deutschsprachiger Dokumente (2006)
0.05
0.05149707 = product of:
0.10299414 = sum of:
0.012957836 = weight(_text_:information in 3581) [ClassicSimilarity], result of:
0.012957836 = score(doc=3581,freq=2.0), product of:
0.08351069 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.047571484 = queryNorm
0.1551638 = fieldWeight in 3581, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0625 = fieldNorm(doc=3581)
0.0900363 = sum of:
0.038474064 = weight(_text_:retrieval in 3581) [ClassicSimilarity], result of:
0.038474064 = score(doc=3581,freq=2.0), product of:
0.1438997 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.047571484 = queryNorm
0.26736724 = fieldWeight in 3581, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.0625 = fieldNorm(doc=3581)
0.051562235 = weight(_text_:22 in 3581) [ClassicSimilarity], result of:
0.051562235 = score(doc=3581,freq=2.0), product of:
0.16658723 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.047571484 = queryNorm
0.30952093 = fieldWeight in 3581, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0625 = fieldNorm(doc=3581)
0.5 = coord(2/4)
- Abstract
- Lingo ist ein frei verfügbares System (open source) zur automatischen Indexierung der deutschen Sprache. Bei der Entwicklung von lingo standen hohe Konfigurierbarkeit und Flexibilität des Systems für unterschiedliche Einsatzmöglichkeiten im Vordergrund. Der Beitrag zeigt den Nutzen einer linguistisch basierten automatischen Indexierung für das Information Retrieval auf. Die für eine Retrievalverbesserung zur Verfügung stehende linguistische Funktionalität von lingo wird vorgestellt und an Beispielen erläutert: Grundformerkennung, Kompositumerkennung bzw. Kompositumzerlegung, Wortrelationierung, lexikalische und algorithmische Mehrwortgruppenerkennung, OCR-Fehlerkorrektur. Der offene Systemaufbau von lingo wird beschrieben, mögliche Einsatzszenarien und Anwendungsgrenzen werden benannt.
- Date
- 24. 3.2006 12:22:02
-
Jardine, N.; Rijsbergen, C.J. van: ¬The use of hierarchic clustering in information retrieval (1971)
0.05
0.045530416 = product of:
0.09106083 = sum of:
0.036650293 = weight(_text_:information in 5170) [ClassicSimilarity], result of:
0.036650293 = score(doc=5170,freq=4.0), product of:
0.08351069 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.047571484 = queryNorm
0.43886948 = fieldWeight in 5170, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.125 = fieldNorm(doc=5170)
0.054410543 = product of:
0.10882109 = sum of:
0.10882109 = weight(_text_:retrieval in 5170) [ClassicSimilarity], result of:
0.10882109 = score(doc=5170,freq=4.0), product of:
0.1438997 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.047571484 = queryNorm
0.75622874 = fieldWeight in 5170, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.125 = fieldNorm(doc=5170)
0.5 = coord(1/2)
0.5 = coord(2/4)
- Source
- Information storage and retrieval. 7(1971), S.217-240
-
Sparck Jones, K.; Jackson, D.M.: ¬The use of automatically obtained keyword classification for information retrieval (1970)
0.05
0.045530416 = product of:
0.09106083 = sum of:
0.036650293 = weight(_text_:information in 5177) [ClassicSimilarity], result of:
0.036650293 = score(doc=5177,freq=4.0), product of:
0.08351069 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.047571484 = queryNorm
0.43886948 = fieldWeight in 5177, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.125 = fieldNorm(doc=5177)
0.054410543 = product of:
0.10882109 = sum of:
0.10882109 = weight(_text_:retrieval in 5177) [ClassicSimilarity], result of:
0.10882109 = score(doc=5177,freq=4.0), product of:
0.1438997 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.047571484 = queryNorm
0.75622874 = fieldWeight in 5177, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.125 = fieldNorm(doc=5177)
0.5 = coord(1/2)
0.5 = coord(2/4)
- Source
- Information storage and retrieval. 5(1970), S.175-201
-
Kantor, P.B.; Voorhees, E.: Information retrieval with scanned texts (2000)
0.05
0.045530416 = product of:
0.09106083 = sum of:
0.036650293 = weight(_text_:information in 3901) [ClassicSimilarity], result of:
0.036650293 = score(doc=3901,freq=4.0), product of:
0.08351069 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.047571484 = queryNorm
0.43886948 = fieldWeight in 3901, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.125 = fieldNorm(doc=3901)
0.054410543 = product of:
0.10882109 = sum of:
0.10882109 = weight(_text_:retrieval in 3901) [ClassicSimilarity], result of:
0.10882109 = score(doc=3901,freq=4.0), product of:
0.1438997 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.047571484 = queryNorm
0.75622874 = fieldWeight in 3901, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.125 = fieldNorm(doc=3901)
0.5 = coord(1/2)
0.5 = coord(2/4)
- Source
- Information retrieval. 2(2000), S.165-176
-
Renz, M.: Automatische Inhaltserschließung im Zeichen von Wissensmanagement (2001)
0.05
0.045059934 = product of:
0.09011987 = sum of:
0.011338106 = weight(_text_:information in 5671) [ClassicSimilarity], result of:
0.011338106 = score(doc=5671,freq=2.0), product of:
0.08351069 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.047571484 = queryNorm
0.13576832 = fieldWeight in 5671, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0546875 = fieldNorm(doc=5671)
0.07878176 = sum of:
0.033664808 = weight(_text_:retrieval in 5671) [ClassicSimilarity], result of:
0.033664808 = score(doc=5671,freq=2.0), product of:
0.1438997 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.047571484 = queryNorm
0.23394634 = fieldWeight in 5671, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.0546875 = fieldNorm(doc=5671)
0.045116954 = weight(_text_:22 in 5671) [ClassicSimilarity], result of:
0.045116954 = score(doc=5671,freq=2.0), product of:
0.16658723 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.047571484 = queryNorm
0.2708308 = fieldWeight in 5671, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0546875 = fieldNorm(doc=5671)
0.5 = coord(2/4)
- Abstract
- Methoden der automatischen Inhaltserschließung werden seit mehr als 30 Jahren entwickelt, ohne in luD-Kreisen auf merkliche Akzeptanz zu stoßen. Gegenwärtig führen jedoch die steigende Informationsflut und der Bedarf an effizienten Zugriffsverfahren im Informations- und Wissensmanagement in breiten Anwenderkreisen zu einem wachsenden Interesse an diesen Methoden, zu verstärkten Anstrengungen in Forschung und Entwicklung und zu neuen Produkten. In diesem Beitrag werden verschiedene Ansätze zu intelligentem und inhaltsbasiertem Retrieval und zur automatischen Inhaltserschließung diskutiert sowie kommerziell vertriebene Softwarewerkzeuge und Lösungen präsentiert. Abschließend wird festgestellt, dass in naher Zukunft mit einer zunehmenden Automatisierung von bestimmten Komponenten des Informations- und Wissensmanagements zu rechnen ist, indem Software-Werkzeuge zur automatischen Inhaltserschließung in den Workflow integriert werden
- Date
- 22. 3.2001 13:14:48
- Source
- nfd Information - Wissenschaft und Praxis. 52(2001) H.2, S.69-78
-
RIAO 91 : Computer aided information retrieval. Conference, Barcelona, 2.-4.5.1991 (1991)
0.04
0.040243585 = product of:
0.08048717 = sum of:
0.032394588 = weight(_text_:information in 4651) [ClassicSimilarity], result of:
0.032394588 = score(doc=4651,freq=2.0), product of:
0.08351069 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.047571484 = queryNorm
0.38790947 = fieldWeight in 4651, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.15625 = fieldNorm(doc=4651)
0.04809258 = product of:
0.09618516 = sum of:
0.09618516 = weight(_text_:retrieval in 4651) [ClassicSimilarity], result of:
0.09618516 = score(doc=4651,freq=2.0), product of:
0.1438997 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.047571484 = queryNorm
0.6684181 = fieldWeight in 4651, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.15625 = fieldNorm(doc=4651)
0.5 = coord(1/2)
0.5 = coord(2/4)
-
Sparck Jones, K.: Automatic keyword classification for information retrieval (1971)
0.04
0.040243585 = product of:
0.08048717 = sum of:
0.032394588 = weight(_text_:information in 5176) [ClassicSimilarity], result of:
0.032394588 = score(doc=5176,freq=2.0), product of:
0.08351069 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.047571484 = queryNorm
0.38790947 = fieldWeight in 5176, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.15625 = fieldNorm(doc=5176)
0.04809258 = product of:
0.09618516 = sum of:
0.09618516 = weight(_text_:retrieval in 5176) [ClassicSimilarity], result of:
0.09618516 = score(doc=5176,freq=2.0), product of:
0.1438997 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.047571484 = queryNorm
0.6684181 = fieldWeight in 5176, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.15625 = fieldNorm(doc=5176)
0.5 = coord(1/2)
0.5 = coord(2/4)
-
Dattola, R.T.: FIRST: Flexible information retrieval system for text (1979)
0.04
0.037562177 = product of:
0.07512435 = sum of:
0.036650293 = weight(_text_:information in 5172) [ClassicSimilarity], result of:
0.036650293 = score(doc=5172,freq=4.0), product of:
0.08351069 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.047571484 = queryNorm
0.43886948 = fieldWeight in 5172, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.125 = fieldNorm(doc=5172)
0.038474064 = product of:
0.07694813 = sum of:
0.07694813 = weight(_text_:retrieval in 5172) [ClassicSimilarity], result of:
0.07694813 = score(doc=5172,freq=2.0), product of:
0.1438997 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.047571484 = queryNorm
0.5347345 = fieldWeight in 5172, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.125 = fieldNorm(doc=5172)
0.5 = coord(1/2)
0.5 = coord(2/4)
- Source
- Journal of the American Society for Information Science. 30(1979), S.9-14
-
Garfield, E.; Sager, N.: Mechanical indexing, structural linguistics and information retrieval (1993)
0.04
0.037562177 = product of:
0.07512435 = sum of:
0.036650293 = weight(_text_:information in 5900) [ClassicSimilarity], result of:
0.036650293 = score(doc=5900,freq=4.0), product of:
0.08351069 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.047571484 = queryNorm
0.43886948 = fieldWeight in 5900, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.125 = fieldNorm(doc=5900)
0.038474064 = product of:
0.07694813 = sum of:
0.07694813 = weight(_text_:retrieval in 5900) [ClassicSimilarity], result of:
0.07694813 = score(doc=5900,freq=2.0), product of:
0.1438997 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.047571484 = queryNorm
0.5347345 = fieldWeight in 5900, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.125 = fieldNorm(doc=5900)
0.5 = coord(1/2)
0.5 = coord(2/4)
- Source
- Journal of information science. 19(1993) no.2, S.164-165
-
Milstead, J.L.: Thesauri in a full-text world (1998)
0.04
0.03623499 = product of:
0.07246998 = sum of:
0.016197294 = weight(_text_:information in 2337) [ClassicSimilarity], result of:
0.016197294 = score(doc=2337,freq=8.0), product of:
0.08351069 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.047571484 = queryNorm
0.19395474 = fieldWeight in 2337, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0390625 = fieldNorm(doc=2337)
0.05627269 = sum of:
0.02404629 = weight(_text_:retrieval in 2337) [ClassicSimilarity], result of:
0.02404629 = score(doc=2337,freq=2.0), product of:
0.1438997 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.047571484 = queryNorm
0.16710453 = fieldWeight in 2337, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.0390625 = fieldNorm(doc=2337)
0.0322264 = weight(_text_:22 in 2337) [ClassicSimilarity], result of:
0.0322264 = score(doc=2337,freq=2.0), product of:
0.16658723 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.047571484 = queryNorm
0.19345059 = fieldWeight in 2337, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0390625 = fieldNorm(doc=2337)
0.5 = coord(2/4)
- Abstract
- Despite early claims to the contemporary, thesauri continue to find use as access tools for information in the full-text environment. Their mode of use is changing, but this change actually represents an expansion rather than a contrdiction of their utility. Thesauri and similar vocabulary tools can complement full-text access by aiding users in focusing their searches, by supplementing the linguistic analysis of the text search engine, and even by serving as one of the tools used by the linguistic engine for its analysis. While human indexing contunues to be used for many databases, the trend is to increase the use of machine aids for this purpose. All machine-aided indexing (MAI) systems rely on thesauri as the basis for term selection. In the 21st century, the balance of effort between human and machine will change at both input and output, but thesauri will continue to play an important role for the foreseeable future
- Date
- 22. 9.1997 19:16:05
- Imprint
- Urbana-Champaign, IL : Illinois University at Urbana-Champaign, Graduate School of Library and Information Science
- Source
- Visualizing subject access for 21st century information resources: Papers presented at the 1997 Clinic on Library Applications of Data Processing, 2-4 Mar 1997, Graduate School of Library and Information Science, University of Illinois at Urbana-Champaign. Ed.: P.A. Cochrane et al
- Theme
- Verbale Doksprachen im Online-Retrieval
-
Salton, G.: Another look at automatic text-retrieval systems (1986)
0.03
0.034983218 = product of:
0.069966435 = sum of:
0.016197294 = weight(_text_:information in 1356) [ClassicSimilarity], result of:
0.016197294 = score(doc=1356,freq=2.0), product of:
0.08351069 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.047571484 = queryNorm
0.19395474 = fieldWeight in 1356, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.078125 = fieldNorm(doc=1356)
0.053769138 = product of:
0.107538275 = sum of:
0.107538275 = weight(_text_:retrieval in 1356) [ClassicSimilarity], result of:
0.107538275 = score(doc=1356,freq=10.0), product of:
0.1438997 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.047571484 = queryNorm
0.74731416 = fieldWeight in 1356, product of:
3.1622777 = tf(freq=10.0), with freq of:
10.0 = termFreq=10.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.078125 = fieldNorm(doc=1356)
0.5 = coord(1/2)
0.5 = coord(2/4)
- Footnote
- Bezugnahme auf: Blair, D.C.: An evaluation of retrieval effectiveness for a full-text document-retrieval system. Comm. ACM 28(1985) S.280-299. - Vgl. auch: Blair, D.C.: Full text retrieval ... Int. Class. 13(1986) S.18-23; Blair, D.C., M.E. Maron: full-text information retrieval ... Inf. Proc. Man. 26(1990) S.437-447.
-
Salton, G.; McGill, M. J.: Information Retrieval: Grundlegendes für Informationswissenschaftler (1987)
0.03
0.03485197 = product of:
0.06970394 = sum of:
0.02805454 = weight(_text_:information in 8648) [ClassicSimilarity], result of:
0.02805454 = score(doc=8648,freq=6.0), product of:
0.08351069 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.047571484 = queryNorm
0.3359395 = fieldWeight in 8648, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.078125 = fieldNorm(doc=8648)
0.041649397 = product of:
0.083298795 = sum of:
0.083298795 = weight(_text_:retrieval in 8648) [ClassicSimilarity], result of:
0.083298795 = score(doc=8648,freq=6.0), product of:
0.1438997 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.047571484 = queryNorm
0.5788671 = fieldWeight in 8648, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.078125 = fieldNorm(doc=8648)
0.5 = coord(1/2)
0.5 = coord(2/4)
- Content
- Enthält die Kapitel: Information Retrieval: eine Einführung; Invertierte Dateisysteme; Textanalyse und automatisches Indexieren; Die experimentellen Retrievalsysteme SMART und SIRE; Die Bewertung von Retrievalsystemen; Fortgeschrittene Retrievaltechniken; Verarbeitung natürlicher Sprache; Informationstechnologie: Hardware und Software; Datenbankmanagementsysteme; Zukünftige Entwicklungen im Information Retrieval
-
Salton, G.: Automatic text processing : the transformation, analysis, and retrieval of information by computer (1989)
0.03
0.03485197 = product of:
0.06970394 = sum of:
0.02805454 = weight(_text_:information in 1307) [ClassicSimilarity], result of:
0.02805454 = score(doc=1307,freq=6.0), product of:
0.08351069 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.047571484 = queryNorm
0.3359395 = fieldWeight in 1307, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.078125 = fieldNorm(doc=1307)
0.041649397 = product of:
0.083298795 = sum of:
0.083298795 = weight(_text_:retrieval in 1307) [ClassicSimilarity], result of:
0.083298795 = score(doc=1307,freq=6.0), product of:
0.1438997 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.047571484 = queryNorm
0.5788671 = fieldWeight in 1307, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.078125 = fieldNorm(doc=1307)
0.5 = coord(1/2)
0.5 = coord(2/4)
- COMPASS
- Information retrieval / Use of / On-line computers
- Subject
- Information retrieval / Use of / On-line computers
-
Hlava, M.M.K.: Automatic indexing : comparing rule-based and statistics-based indexing systems (2005)
0.03
0.033896584 = product of:
0.06779317 = sum of:
0.022676213 = weight(_text_:information in 6265) [ClassicSimilarity], result of:
0.022676213 = score(doc=6265,freq=2.0), product of:
0.08351069 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.047571484 = queryNorm
0.27153665 = fieldWeight in 6265, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.109375 = fieldNorm(doc=6265)
0.045116954 = product of:
0.09023391 = sum of:
0.09023391 = weight(_text_:22 in 6265) [ClassicSimilarity], result of:
0.09023391 = score(doc=6265,freq=2.0), product of:
0.16658723 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.047571484 = queryNorm
0.5416616 = fieldWeight in 6265, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.109375 = fieldNorm(doc=6265)
0.5 = coord(1/2)
0.5 = coord(2/4)
- Source
- Information outlook. 9(2005) no.8, S.22-23
-
Plaunt, C.; Norgard, B.A.: ¬An association-based method for automatic indexing with a controlled vocabulary (1998)
0.03
0.033862952 = product of:
0.067725904 = sum of:
0.011453216 = weight(_text_:information in 1794) [ClassicSimilarity], result of:
0.011453216 = score(doc=1794,freq=4.0), product of:
0.08351069 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.047571484 = queryNorm
0.13714671 = fieldWeight in 1794, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0390625 = fieldNorm(doc=1794)
0.05627269 = sum of:
0.02404629 = weight(_text_:retrieval in 1794) [ClassicSimilarity], result of:
0.02404629 = score(doc=1794,freq=2.0), product of:
0.1438997 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.047571484 = queryNorm
0.16710453 = fieldWeight in 1794, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.0390625 = fieldNorm(doc=1794)
0.0322264 = weight(_text_:22 in 1794) [ClassicSimilarity], result of:
0.0322264 = score(doc=1794,freq=2.0), product of:
0.16658723 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.047571484 = queryNorm
0.19345059 = fieldWeight in 1794, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0390625 = fieldNorm(doc=1794)
0.5 = coord(2/4)
- Abstract
- In this article, we describe and test a two-stage algorithm based on a lexical collocation technique which maps from the lexical clues contained in a document representation into a controlled vocabulary list of subject headings. Using a collection of 4.626 INSPEC documents, we create a 'dictionary' of associations between the lexical items contained in the titles, authors, and abstracts, and controlled vocabulary subject headings assigned to those records by human indexers using a likelihood ratio statistic as the measure of association. In the deployment stage, we use the dictiony to predict which of the controlled vocabulary subject headings best describe new documents when they are presented to the system. Our evaluation of this algorithm, in which we compare the automatically assigned subject headings to the subject headings assigned to the test documents by human catalogers, shows that we can obtain results comparable to, and consistent with, human cataloging. In effect we have cast this as a classic partial match information retrieval problem. We consider the problem to be one of 'retrieving' (or assigning) the most probably 'relevant' (or correct) controlled vocabulary subject headings to a document based on the clues contained in that document
- Date
- 11. 9.2000 19:53:22
- Source
- Journal of the American Society for Information Science. 49(1998) no.10, S.888-902
-
Griffiths, A.; Luckhurst, H.C.; Willett, P.: Using interdocument similarity information in document retrieval systems (1986)
0.03
0.032866906 = product of:
0.06573381 = sum of:
0.032069005 = weight(_text_:information in 2415) [ClassicSimilarity], result of:
0.032069005 = score(doc=2415,freq=4.0), product of:
0.08351069 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.047571484 = queryNorm
0.3840108 = fieldWeight in 2415, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.109375 = fieldNorm(doc=2415)
0.033664808 = product of:
0.067329615 = sum of:
0.067329615 = weight(_text_:retrieval in 2415) [ClassicSimilarity], result of:
0.067329615 = score(doc=2415,freq=2.0), product of:
0.1438997 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.047571484 = queryNorm
0.46789268 = fieldWeight in 2415, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.109375 = fieldNorm(doc=2415)
0.5 = coord(1/2)
0.5 = coord(2/4)
- Source
- Journal of the American Society for Information Science. 37(1986) no.1, S.3-11