# Document (#2323)

Author
Panyr, J.
Title
Vektorraum-Modell und Clusteranalyse in Information-Retrieval-Systemen
Source
Nachrichten für Dokumentation. 38(1987) H.1, S.13-20
Year
1987
Abstract
Ausgehend von theoretischen Indexierungsansätzen wird das klassische Vektorraum-Modell für automatische Indexierung (mit dem Trennschärfen-Modell) erläutert. Das Clustering in Information-Retrieval-Systemem wird als eine natürliche logische Folge aus diesem Modell aufgefaßt und in allen seinen Ausprägungen (d.h. als Dokumenten-, Term- oder Dokumenten- und Termklassifikation) behandelt. Anschließend werden die Suchstrategien in vorklassifizierten Dokumentenbeständen (Clustersuche) detailliert beschrieben. Zum Schluß wird noch die sinnvolle Anwendung der Clusteranalyse in Information-Retrieval-Systemen kurz diskutiert
Theme
Automatisches Indexieren
Automatisches Klassifizieren

## Similar documents (author)

1. Panyr, J.: Thesaurus und wissensbasierte Systeme - Thesauri und Wissensbasen (1988) 5.46
```5.456504 = sum of:
5.456504 = weight(author_txt:panyr in 22) [ClassicSimilarity], result of:
5.456504 = fieldWeight in 22, product of:
1.0 = tf(freq=1.0), with freq of:
1.0 = termFreq=1.0
8.730406 = idf(docFreq=18, maxDocs=43254)
0.625 = fieldNorm(doc=22)
```
2. Panyr, J.: Information-Retrieval-Methoden in regelbasierten Expertensystemen (1990) 5.46
```5.456504 = sum of:
5.456504 = weight(author_txt:panyr in 260) [ClassicSimilarity], result of:
5.456504 = fieldWeight in 260, product of:
1.0 = tf(freq=1.0), with freq of:
1.0 = termFreq=1.0
8.730406 = idf(docFreq=18, maxDocs=43254)
0.625 = fieldNorm(doc=260)
```
3. Panyr, J.: Vom Wissen zur Information : Notwendigkeit der Kooperation der Fachleute aus dem Bereich der Informations-Retrieval-Systeme und der Systeme mit formaler Intelligenz (1988) 5.46
```5.456504 = sum of:
5.456504 = weight(author_txt:panyr in 768) [ClassicSimilarity], result of:
5.456504 = fieldWeight in 768, product of:
1.0 = tf(freq=1.0), with freq of:
1.0 = termFreq=1.0
8.730406 = idf(docFreq=18, maxDocs=43254)
0.625 = fieldNorm(doc=768)
```
4. Panyr, J.: ¬Die Theorie der Fuzzy-Mengen und Information-Retrieval-Systeme (1986) 5.46
```5.456504 = sum of:
5.456504 = weight(author_txt:panyr in 788) [ClassicSimilarity], result of:
5.456504 = fieldWeight in 788, product of:
1.0 = tf(freq=1.0), with freq of:
1.0 = termFreq=1.0
8.730406 = idf(docFreq=18, maxDocs=43254)
0.625 = fieldNorm(doc=788)
```
5. Panyr, J.: Probabilistische Modelle in Information-Retrieval-Systemen (1986) 5.46
```5.456504 = sum of:
5.456504 = weight(author_txt:panyr in 1460) [ClassicSimilarity], result of:
5.456504 = fieldWeight in 1460, product of:
1.0 = tf(freq=1.0), with freq of:
1.0 = termFreq=1.0
8.730406 = idf(docFreq=18, maxDocs=43254)
0.625 = fieldNorm(doc=1460)
```

## Similar documents (content)

1. Kaiser, A.: Computer-unterstütztes Indexieren in Intelligenten Information Retrieval Systemen : Ein Relevanz-Feedback orientierter Ansatz zur Informationserschließung in unformatierten Datenbanken (1993) 0.19
```0.18881702 = sum of:
0.18881702 = product of:
0.5244917 = sum of:
0.022143634 = weight(abstract_txt:behandelt in 749) [ClassicSimilarity], result of:
0.022143634 = score(doc=749,freq=1.0), product of:
0.11385256 = queryWeight, product of:
6.2238064 = idf(docFreq=232, maxDocs=43254)
0.018293075 = queryNorm
0.19449395 = fieldWeight in 749, product of:
1.0 = tf(freq=1.0), with freq of:
1.0 = termFreq=1.0
6.2238064 = idf(docFreq=232, maxDocs=43254)
0.03125 = fieldNorm(doc=749)
0.069343574 = weight(abstract_txt:indexierung in 749) [ClassicSimilarity], result of:
0.069343574 = score(doc=749,freq=6.0), product of:
0.13411085 = queryWeight, product of:
1.0853269 = boost
6.754864 = idf(docFreq=136, maxDocs=43254)
0.018293075 = queryNorm
0.5170616 = fieldWeight in 749, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
6.754864 = idf(docFreq=136, maxDocs=43254)
0.03125 = fieldNorm(doc=749)
0.03080054 = weight(abstract_txt:automatische in 749) [ClassicSimilarity], result of:
0.03080054 = score(doc=749,freq=1.0), product of:
0.14186734 = queryWeight, product of:
1.1162715 = boost
6.9474573 = idf(docFreq=112, maxDocs=43254)
0.018293075 = queryNorm
0.21710804 = fieldWeight in 749, product of:
1.0 = tf(freq=1.0), with freq of:
1.0 = termFreq=1.0
6.9474573 = idf(docFreq=112, maxDocs=43254)
0.03125 = fieldNorm(doc=749)
0.014201688 = weight(abstract_txt:information in 749) [ClassicSimilarity], result of:
0.014201688 = score(doc=749,freq=13.0), product of:
0.05193532 = queryWeight, product of:
1.1698242 = boost
2.42692 = idf(docFreq=10382, maxDocs=43254)
0.018293075 = queryNorm
0.2734495 = fieldWeight in 749, product of:
3.6055512 = tf(freq=13.0), with freq of:
13.0 = termFreq=13.0
2.42692 = idf(docFreq=10382, maxDocs=43254)
0.03125 = fieldNorm(doc=749)
0.032561 = weight(abstract_txt:retrieval in 749) [ClassicSimilarity], result of:
0.032561 = score(doc=749,freq=8.0), product of:
0.10616608 = queryWeight, product of:
1.6725615 = boost
3.4699 = idf(docFreq=3658, maxDocs=43254)
0.018293075 = queryNorm
0.3066987 = fieldWeight in 749, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
3.4699 = idf(docFreq=3658, maxDocs=43254)
0.03125 = fieldNorm(doc=749)
0.033550333 = weight(abstract_txt:wird in 749) [ClassicSimilarity], result of:
0.033550333 = score(doc=749,freq=5.0), product of:
0.12667528 = queryWeight, product of:
1.8269858 = boost
3.7902684 = idf(docFreq=2655, maxDocs=43254)
0.018293075 = queryNorm
0.26485306 = fieldWeight in 749, product of:
2.236068 = tf(freq=5.0), with freq of:
5.0 = termFreq=5.0
3.7902684 = idf(docFreq=2655, maxDocs=43254)
0.03125 = fieldNorm(doc=749)
0.120693326 = weight(abstract_txt:systemen in 749) [ClassicSimilarity], result of:
0.120693326 = score(doc=749,freq=6.0), product of:
0.2444886 = queryWeight, product of:
2.0723968 = boost
6.449098 = idf(docFreq=185, maxDocs=43254)
0.018293075 = queryNorm
0.49365625 = fieldWeight in 749, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
6.449098 = idf(docFreq=185, maxDocs=43254)
0.03125 = fieldNorm(doc=749)
0.09929298 = weight(abstract_txt:dokumenten in 749) [ClassicSimilarity], result of:
0.09929298 = score(doc=749,freq=4.0), product of:
0.24572305 = queryWeight, product of:
2.0776222 = boost
6.4653587 = idf(docFreq=182, maxDocs=43254)
0.018293075 = queryNorm
0.40408492 = fieldWeight in 749, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
6.4653587 = idf(docFreq=182, maxDocs=43254)
0.03125 = fieldNorm(doc=749)
0.10190461 = weight(abstract_txt:modell in 749) [ClassicSimilarity], result of:
0.10190461 = score(doc=749,freq=1.0), product of:
0.50002617 = queryWeight, product of:
4.19136 = boost
6.5215535 = idf(docFreq=172, maxDocs=43254)
0.018293075 = queryNorm
0.20379855 = fieldWeight in 749, product of:
1.0 = tf(freq=1.0), with freq of:
1.0 = termFreq=1.0
6.5215535 = idf(docFreq=172, maxDocs=43254)
0.03125 = fieldNorm(doc=749)
0.36 = coord(9/25)
```
2. Panyr, J.: Probabilistische Modelle in Information-Retrieval-Systemen (1986) 0.18
```0.18499948 = sum of:
0.18499948 = product of:
0.77083117 = sum of:
0.09163116 = weight(abstract_txt:kurz in 1460) [ClassicSimilarity], result of:
0.09163116 = score(doc=1460,freq=1.0), product of:
0.12729958 = queryWeight, product of:
1.0574068 = boost
6.5810947 = idf(docFreq=162, maxDocs=43254)
0.018293075 = queryNorm
0.71980727 = fieldWeight in 1460, product of:
1.0 = tf(freq=1.0), with freq of:
1.0 = termFreq=1.0
6.5810947 = idf(docFreq=162, maxDocs=43254)
0.109375 = fieldNorm(doc=1460)
0.09908288 = weight(abstract_txt:indexierung in 1460) [ClassicSimilarity], result of:
0.09908288 = score(doc=1460,freq=1.0), product of:
0.13411085 = queryWeight, product of:
1.0853269 = boost
6.754864 = idf(docFreq=136, maxDocs=43254)
0.018293075 = queryNorm
0.7388133 = fieldWeight in 1460, product of:
1.0 = tf(freq=1.0), with freq of:
1.0 = termFreq=1.0
6.754864 = idf(docFreq=136, maxDocs=43254)
0.109375 = fieldNorm(doc=1460)
0.10210323 = weight(abstract_txt:ausgehend in 1460) [ClassicSimilarity], result of:
0.10210323 = score(doc=1460,freq=1.0), product of:
0.1368226 = queryWeight, product of:
1.0962447 = boost
6.822815 = idf(docFreq=127, maxDocs=43254)
0.018293075 = queryNorm
0.7462454 = fieldWeight in 1460, product of:
1.0 = tf(freq=1.0), with freq of:
1.0 = termFreq=1.0
6.822815 = idf(docFreq=127, maxDocs=43254)
0.109375 = fieldNorm(doc=1460)
0.15528427 = weight(abstract_txt:detailliert in 1460) [ClassicSimilarity], result of:
0.15528427 = score(doc=1460,freq=1.0), product of:
0.18094635 = queryWeight, product of:
1.260676 = boost
7.846204 = idf(docFreq=45, maxDocs=43254)
0.018293075 = queryNorm
0.85817856 = fieldWeight in 1460, product of:
1.0 = tf(freq=1.0), with freq of:
1.0 = termFreq=1.0
7.846204 = idf(docFreq=45, maxDocs=43254)
0.109375 = fieldNorm(doc=1460)
0.2317717 = weight(abstract_txt:schluß in 1460) [ClassicSimilarity], result of:
0.2317717 = score(doc=1460,freq=1.0), product of:
0.23632252 = queryWeight, product of:
1.4407252 = boost
8.966795 = idf(docFreq=14, maxDocs=43254)
0.018293075 = queryNorm
0.98074317 = fieldWeight in 1460, product of:
1.0 = tf(freq=1.0), with freq of:
1.0 = termFreq=1.0
8.966795 = idf(docFreq=14, maxDocs=43254)
0.109375 = fieldNorm(doc=1460)
0.090957925 = weight(abstract_txt:wird in 1460) [ClassicSimilarity], result of:
0.090957925 = score(doc=1460,freq=3.0), product of:
0.12667528 = queryWeight, product of:
1.8269858 = boost
3.7902684 = idf(docFreq=2655, maxDocs=43254)
0.018293075 = queryNorm
0.71804005 = fieldWeight in 1460, product of:
1.7320508 = tf(freq=3.0), with freq of:
3.0 = termFreq=3.0
3.7902684 = idf(docFreq=2655, maxDocs=43254)
0.109375 = fieldNorm(doc=1460)
0.24 = coord(6/25)
```
3. Fuhr, N.: Theorie des Information Retrieval I : Modelle (2004) 0.15
```0.14621846 = sum of:
0.14621846 = product of:
0.60924363 = sum of:
0.056618787 = weight(abstract_txt:indexierung in 4913) [ClassicSimilarity], result of:
0.056618787 = score(doc=4913,freq=1.0), product of:
0.13411085 = queryWeight, product of:
1.0853269 = boost
6.754864 = idf(docFreq=136, maxDocs=43254)
0.018293075 = queryNorm
0.422179 = fieldWeight in 4913, product of:
1.0 = tf(freq=1.0), with freq of:
1.0 = termFreq=1.0
6.754864 = idf(docFreq=136, maxDocs=43254)
0.0625 = fieldNorm(doc=4913)
0.007877679 = weight(abstract_txt:information in 4913) [ClassicSimilarity], result of:
0.007877679 = score(doc=4913,freq=1.0), product of:
0.05193532 = queryWeight, product of:
1.1698242 = boost
2.42692 = idf(docFreq=10382, maxDocs=43254)
0.018293075 = queryNorm
0.1516825 = fieldWeight in 4913, product of:
1.0 = tf(freq=1.0), with freq of:
1.0 = termFreq=1.0
2.42692 = idf(docFreq=10382, maxDocs=43254)
0.0625 = fieldNorm(doc=4913)
0.032561 = weight(abstract_txt:retrieval in 4913) [ClassicSimilarity], result of:
0.032561 = score(doc=4913,freq=2.0), product of:
0.10616608 = queryWeight, product of:
1.6725615 = boost
3.4699 = idf(docFreq=3658, maxDocs=43254)
0.018293075 = queryNorm
0.3066987 = fieldWeight in 4913, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.4699 = idf(docFreq=3658, maxDocs=43254)
0.0625 = fieldNorm(doc=4913)
0.051975954 = weight(abstract_txt:wird in 4913) [ClassicSimilarity], result of:
0.051975954 = score(doc=4913,freq=3.0), product of:
0.12667528 = queryWeight, product of:
1.8269858 = boost
3.7902684 = idf(docFreq=2655, maxDocs=43254)
0.018293075 = queryNorm
0.4103086 = fieldWeight in 4913, product of:
1.7320508 = tf(freq=3.0), with freq of:
3.0 = termFreq=3.0
3.7902684 = idf(docFreq=2655, maxDocs=43254)
0.0625 = fieldNorm(doc=4913)
0.17198049 = weight(abstract_txt:dokumenten in 4913) [ClassicSimilarity], result of:
0.17198049 = score(doc=4913,freq=3.0), product of:
0.24572305 = queryWeight, product of:
2.0776222 = boost
6.4653587 = idf(docFreq=182, maxDocs=43254)
0.018293075 = queryNorm
0.6998956 = fieldWeight in 4913, product of:
1.7320508 = tf(freq=3.0), with freq of:
3.0 = termFreq=3.0
6.4653587 = idf(docFreq=182, maxDocs=43254)
0.0625 = fieldNorm(doc=4913)
0.28822973 = weight(abstract_txt:modell in 4913) [ClassicSimilarity], result of:
0.28822973 = score(doc=4913,freq=2.0), product of:
0.50002617 = queryWeight, product of:
4.19136 = boost
6.5215535 = idf(docFreq=172, maxDocs=43254)
0.018293075 = queryNorm
0.5764293 = fieldWeight in 4913, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
6.5215535 = idf(docFreq=172, maxDocs=43254)
0.0625 = fieldNorm(doc=4913)
0.24 = coord(6/25)
```
4. Siebenlist, T.: MEMOSE. Spezialsuchmaschine für emotional geladene Dokumente (2012) 0.13
```0.12650649 = sum of:
0.12650649 = product of:
0.5271104 = sum of:
0.07854099 = weight(abstract_txt:kurz in 1640) [ClassicSimilarity], result of:
0.07854099 = score(doc=1640,freq=1.0), product of:
0.12729958 = queryWeight, product of:
1.0574068 = boost
6.5810947 = idf(docFreq=162, maxDocs=43254)
0.018293075 = queryNorm
0.61697763 = fieldWeight in 1640, product of:
1.0 = tf(freq=1.0), with freq of:
1.0 = termFreq=1.0
6.5810947 = idf(docFreq=162, maxDocs=43254)
0.09375 = fieldNorm(doc=1640)
0.084928185 = weight(abstract_txt:indexierung in 1640) [ClassicSimilarity], result of:
0.084928185 = score(doc=1640,freq=1.0), product of:
0.13411085 = queryWeight, product of:
1.0853269 = boost
6.754864 = idf(docFreq=136, maxDocs=43254)
0.018293075 = queryNorm
0.63326854 = fieldWeight in 1640, product of:
1.0 = tf(freq=1.0), with freq of:
1.0 = termFreq=1.0
6.754864 = idf(docFreq=136, maxDocs=43254)
0.09375 = fieldNorm(doc=1640)
0.011816518 = weight(abstract_txt:information in 1640) [ClassicSimilarity], result of:
0.011816518 = score(doc=1640,freq=1.0), product of:
0.05193532 = queryWeight, product of:
1.1698242 = boost
2.42692 = idf(docFreq=10382, maxDocs=43254)
0.018293075 = queryNorm
0.22752374 = fieldWeight in 1640, product of:
1.0 = tf(freq=1.0), with freq of:
1.0 = termFreq=1.0
2.42692 = idf(docFreq=10382, maxDocs=43254)
0.09375 = fieldNorm(doc=1640)
0.048841503 = weight(abstract_txt:retrieval in 1640) [ClassicSimilarity], result of:
0.048841503 = score(doc=1640,freq=2.0), product of:
0.10616608 = queryWeight, product of:
1.6725615 = boost
3.4699 = idf(docFreq=3658, maxDocs=43254)
0.018293075 = queryNorm
0.46004808 = fieldWeight in 1640, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.4699 = idf(docFreq=3658, maxDocs=43254)
0.09375 = fieldNorm(doc=1640)
0.0450125 = weight(abstract_txt:wird in 1640) [ClassicSimilarity], result of:
0.0450125 = score(doc=1640,freq=1.0), product of:
0.12667528 = queryWeight, product of:
1.8269858 = boost
3.7902684 = idf(docFreq=2655, maxDocs=43254)
0.018293075 = queryNorm
0.35533768 = fieldWeight in 1640, product of:
1.0 = tf(freq=1.0), with freq of:
1.0 = termFreq=1.0
3.7902684 = idf(docFreq=2655, maxDocs=43254)
0.09375 = fieldNorm(doc=1640)
0.25797072 = weight(abstract_txt:dokumenten in 1640) [ClassicSimilarity], result of:
0.25797072 = score(doc=1640,freq=3.0), product of:
0.24572305 = queryWeight, product of:
2.0776222 = boost
6.4653587 = idf(docFreq=182, maxDocs=43254)
0.018293075 = queryNorm
1.0498434 = fieldWeight in 1640, product of:
1.7320508 = tf(freq=3.0), with freq of:
3.0 = termFreq=3.0
6.4653587 = idf(docFreq=182, maxDocs=43254)
0.09375 = fieldNorm(doc=1640)
0.24 = coord(6/25)
```
5. Markscheffel, B.: ¬Eine Entwurfsmethodik für Hypermedia-Systeme auf Basis des Spatial-Satellite-Modells S**2M (1993) 0.12
```0.1228141 = sum of:
0.1228141 = product of:
0.76758814 = sum of:
0.1237678 = weight(abstract_txt:ausgehend in 3709) [ClassicSimilarity], result of:
0.1237678 = score(doc=3709,freq=2.0), product of:
0.1368226 = queryWeight, product of:
1.0962447 = boost
6.822815 = idf(docFreq=127, maxDocs=43254)
0.018293075 = queryNorm
0.90458596 = fieldWeight in 3709, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
6.822815 = idf(docFreq=127, maxDocs=43254)
0.09375 = fieldNorm(doc=3709)
0.063657284 = weight(abstract_txt:wird in 3709) [ClassicSimilarity], result of:
0.063657284 = score(doc=3709,freq=2.0), product of:
0.12667528 = queryWeight, product of:
1.8269858 = boost
3.7902684 = idf(docFreq=2655, maxDocs=43254)
0.018293075 = queryNorm
0.50252336 = fieldWeight in 3709, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.7902684 = idf(docFreq=2655, maxDocs=43254)
0.09375 = fieldNorm(doc=3709)
0.14781852 = weight(abstract_txt:systemen in 3709) [ClassicSimilarity], result of:
0.14781852 = score(doc=3709,freq=1.0), product of:
0.2444886 = queryWeight, product of:
2.0723968 = boost
6.449098 = idf(docFreq=185, maxDocs=43254)
0.018293075 = queryNorm
0.60460293 = fieldWeight in 3709, product of:
1.0 = tf(freq=1.0), with freq of:
1.0 = termFreq=1.0
6.449098 = idf(docFreq=185, maxDocs=43254)
0.09375 = fieldNorm(doc=3709)
0.4323446 = weight(abstract_txt:modell in 3709) [ClassicSimilarity], result of:
0.4323446 = score(doc=3709,freq=2.0), product of:
0.50002617 = queryWeight, product of:
4.19136 = boost
6.5215535 = idf(docFreq=172, maxDocs=43254)
0.018293075 = queryNorm
0.86464393 = fieldWeight in 3709, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
6.5215535 = idf(docFreq=172, maxDocs=43254)
0.09375 = fieldNorm(doc=3709)
0.16 = coord(4/25)
```