Document (#2323)

Author
Panyr, J.
Title
Vektorraum-Modell und Clusteranalyse in Information-Retrieval-Systemen
Source
Nachrichten für Dokumentation. 38(1987) H.1, S.13-20
Year
1987
Abstract
Ausgehend von theoretischen Indexierungsansätzen wird das klassische Vektorraum-Modell für automatische Indexierung (mit dem Trennschärfen-Modell) erläutert. Das Clustering in Information-Retrieval-Systemem wird als eine natürliche logische Folge aus diesem Modell aufgefaßt und in allen seinen Ausprägungen (d.h. als Dokumenten-, Term- oder Dokumenten- und Termklassifikation) behandelt. Anschließend werden die Suchstrategien in vorklassifizierten Dokumentenbeständen (Clustersuche) detailliert beschrieben. Zum Schluß wird noch die sinnvolle Anwendung der Clusteranalyse in Information-Retrieval-Systemen kurz diskutiert
Theme
Automatisches Indexieren
Automatisches Klassifizieren

Similar documents (author)

  1. Panyr, J.: Thesaurus und wissensbasierte Systeme - Thesauri und Wissensbasen (1988) 5.46
    5.456504 = sum of:
      5.456504 = weight(author_txt:panyr in 22) [ClassicSimilarity], result of:
        5.456504 = fieldWeight in 22, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.730406 = idf(docFreq=18, maxDocs=43254)
          0.625 = fieldNorm(doc=22)
    
  2. Panyr, J.: Information-Retrieval-Methoden in regelbasierten Expertensystemen (1990) 5.46
    5.456504 = sum of:
      5.456504 = weight(author_txt:panyr in 260) [ClassicSimilarity], result of:
        5.456504 = fieldWeight in 260, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.730406 = idf(docFreq=18, maxDocs=43254)
          0.625 = fieldNorm(doc=260)
    
  3. Panyr, J.: Vom Wissen zur Information : Notwendigkeit der Kooperation der Fachleute aus dem Bereich der Informations-Retrieval-Systeme und der Systeme mit formaler Intelligenz (1988) 5.46
    5.456504 = sum of:
      5.456504 = weight(author_txt:panyr in 768) [ClassicSimilarity], result of:
        5.456504 = fieldWeight in 768, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.730406 = idf(docFreq=18, maxDocs=43254)
          0.625 = fieldNorm(doc=768)
    
  4. Panyr, J.: ¬Die Theorie der Fuzzy-Mengen und Information-Retrieval-Systeme (1986) 5.46
    5.456504 = sum of:
      5.456504 = weight(author_txt:panyr in 788) [ClassicSimilarity], result of:
        5.456504 = fieldWeight in 788, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.730406 = idf(docFreq=18, maxDocs=43254)
          0.625 = fieldNorm(doc=788)
    
  5. Panyr, J.: Probabilistische Modelle in Information-Retrieval-Systemen (1986) 5.46
    5.456504 = sum of:
      5.456504 = weight(author_txt:panyr in 1460) [ClassicSimilarity], result of:
        5.456504 = fieldWeight in 1460, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.730406 = idf(docFreq=18, maxDocs=43254)
          0.625 = fieldNorm(doc=1460)
    

Similar documents (content)

  1. Kaiser, A.: Computer-unterstütztes Indexieren in Intelligenten Information Retrieval Systemen : Ein Relevanz-Feedback orientierter Ansatz zur Informationserschließung in unformatierten Datenbanken (1993) 0.19
    0.18881702 = sum of:
      0.18881702 = product of:
        0.5244917 = sum of:
          0.022143634 = weight(abstract_txt:behandelt in 749) [ClassicSimilarity], result of:
            0.022143634 = score(doc=749,freq=1.0), product of:
              0.11385256 = queryWeight, product of:
                6.2238064 = idf(docFreq=232, maxDocs=43254)
                0.018293075 = queryNorm
              0.19449395 = fieldWeight in 749, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2238064 = idf(docFreq=232, maxDocs=43254)
                0.03125 = fieldNorm(doc=749)
          0.069343574 = weight(abstract_txt:indexierung in 749) [ClassicSimilarity], result of:
            0.069343574 = score(doc=749,freq=6.0), product of:
              0.13411085 = queryWeight, product of:
                1.0853269 = boost
                6.754864 = idf(docFreq=136, maxDocs=43254)
                0.018293075 = queryNorm
              0.5170616 = fieldWeight in 749, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.754864 = idf(docFreq=136, maxDocs=43254)
                0.03125 = fieldNorm(doc=749)
          0.03080054 = weight(abstract_txt:automatische in 749) [ClassicSimilarity], result of:
            0.03080054 = score(doc=749,freq=1.0), product of:
              0.14186734 = queryWeight, product of:
                1.1162715 = boost
                6.9474573 = idf(docFreq=112, maxDocs=43254)
                0.018293075 = queryNorm
              0.21710804 = fieldWeight in 749, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9474573 = idf(docFreq=112, maxDocs=43254)
                0.03125 = fieldNorm(doc=749)
          0.014201688 = weight(abstract_txt:information in 749) [ClassicSimilarity], result of:
            0.014201688 = score(doc=749,freq=13.0), product of:
              0.05193532 = queryWeight, product of:
                1.1698242 = boost
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.018293075 = queryNorm
              0.2734495 = fieldWeight in 749, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.03125 = fieldNorm(doc=749)
          0.032561 = weight(abstract_txt:retrieval in 749) [ClassicSimilarity], result of:
            0.032561 = score(doc=749,freq=8.0), product of:
              0.10616608 = queryWeight, product of:
                1.6725615 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.018293075 = queryNorm
              0.3066987 = fieldWeight in 749, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.03125 = fieldNorm(doc=749)
          0.033550333 = weight(abstract_txt:wird in 749) [ClassicSimilarity], result of:
            0.033550333 = score(doc=749,freq=5.0), product of:
              0.12667528 = queryWeight, product of:
                1.8269858 = boost
                3.7902684 = idf(docFreq=2655, maxDocs=43254)
                0.018293075 = queryNorm
              0.26485306 = fieldWeight in 749, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.7902684 = idf(docFreq=2655, maxDocs=43254)
                0.03125 = fieldNorm(doc=749)
          0.120693326 = weight(abstract_txt:systemen in 749) [ClassicSimilarity], result of:
            0.120693326 = score(doc=749,freq=6.0), product of:
              0.2444886 = queryWeight, product of:
                2.0723968 = boost
                6.449098 = idf(docFreq=185, maxDocs=43254)
                0.018293075 = queryNorm
              0.49365625 = fieldWeight in 749, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.449098 = idf(docFreq=185, maxDocs=43254)
                0.03125 = fieldNorm(doc=749)
          0.09929298 = weight(abstract_txt:dokumenten in 749) [ClassicSimilarity], result of:
            0.09929298 = score(doc=749,freq=4.0), product of:
              0.24572305 = queryWeight, product of:
                2.0776222 = boost
                6.4653587 = idf(docFreq=182, maxDocs=43254)
                0.018293075 = queryNorm
              0.40408492 = fieldWeight in 749, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.4653587 = idf(docFreq=182, maxDocs=43254)
                0.03125 = fieldNorm(doc=749)
          0.10190461 = weight(abstract_txt:modell in 749) [ClassicSimilarity], result of:
            0.10190461 = score(doc=749,freq=1.0), product of:
              0.50002617 = queryWeight, product of:
                4.19136 = boost
                6.5215535 = idf(docFreq=172, maxDocs=43254)
                0.018293075 = queryNorm
              0.20379855 = fieldWeight in 749, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5215535 = idf(docFreq=172, maxDocs=43254)
                0.03125 = fieldNorm(doc=749)
        0.36 = coord(9/25)
    
  2. Panyr, J.: Probabilistische Modelle in Information-Retrieval-Systemen (1986) 0.18
    0.18499948 = sum of:
      0.18499948 = product of:
        0.77083117 = sum of:
          0.09163116 = weight(abstract_txt:kurz in 1460) [ClassicSimilarity], result of:
            0.09163116 = score(doc=1460,freq=1.0), product of:
              0.12729958 = queryWeight, product of:
                1.0574068 = boost
                6.5810947 = idf(docFreq=162, maxDocs=43254)
                0.018293075 = queryNorm
              0.71980727 = fieldWeight in 1460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5810947 = idf(docFreq=162, maxDocs=43254)
                0.109375 = fieldNorm(doc=1460)
          0.09908288 = weight(abstract_txt:indexierung in 1460) [ClassicSimilarity], result of:
            0.09908288 = score(doc=1460,freq=1.0), product of:
              0.13411085 = queryWeight, product of:
                1.0853269 = boost
                6.754864 = idf(docFreq=136, maxDocs=43254)
                0.018293075 = queryNorm
              0.7388133 = fieldWeight in 1460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.754864 = idf(docFreq=136, maxDocs=43254)
                0.109375 = fieldNorm(doc=1460)
          0.10210323 = weight(abstract_txt:ausgehend in 1460) [ClassicSimilarity], result of:
            0.10210323 = score(doc=1460,freq=1.0), product of:
              0.1368226 = queryWeight, product of:
                1.0962447 = boost
                6.822815 = idf(docFreq=127, maxDocs=43254)
                0.018293075 = queryNorm
              0.7462454 = fieldWeight in 1460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.822815 = idf(docFreq=127, maxDocs=43254)
                0.109375 = fieldNorm(doc=1460)
          0.15528427 = weight(abstract_txt:detailliert in 1460) [ClassicSimilarity], result of:
            0.15528427 = score(doc=1460,freq=1.0), product of:
              0.18094635 = queryWeight, product of:
                1.260676 = boost
                7.846204 = idf(docFreq=45, maxDocs=43254)
                0.018293075 = queryNorm
              0.85817856 = fieldWeight in 1460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.846204 = idf(docFreq=45, maxDocs=43254)
                0.109375 = fieldNorm(doc=1460)
          0.2317717 = weight(abstract_txt:schluß in 1460) [ClassicSimilarity], result of:
            0.2317717 = score(doc=1460,freq=1.0), product of:
              0.23632252 = queryWeight, product of:
                1.4407252 = boost
                8.966795 = idf(docFreq=14, maxDocs=43254)
                0.018293075 = queryNorm
              0.98074317 = fieldWeight in 1460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.966795 = idf(docFreq=14, maxDocs=43254)
                0.109375 = fieldNorm(doc=1460)
          0.090957925 = weight(abstract_txt:wird in 1460) [ClassicSimilarity], result of:
            0.090957925 = score(doc=1460,freq=3.0), product of:
              0.12667528 = queryWeight, product of:
                1.8269858 = boost
                3.7902684 = idf(docFreq=2655, maxDocs=43254)
                0.018293075 = queryNorm
              0.71804005 = fieldWeight in 1460, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.7902684 = idf(docFreq=2655, maxDocs=43254)
                0.109375 = fieldNorm(doc=1460)
        0.24 = coord(6/25)
    
  3. Fuhr, N.: Theorie des Information Retrieval I : Modelle (2004) 0.15
    0.14621846 = sum of:
      0.14621846 = product of:
        0.60924363 = sum of:
          0.056618787 = weight(abstract_txt:indexierung in 4913) [ClassicSimilarity], result of:
            0.056618787 = score(doc=4913,freq=1.0), product of:
              0.13411085 = queryWeight, product of:
                1.0853269 = boost
                6.754864 = idf(docFreq=136, maxDocs=43254)
                0.018293075 = queryNorm
              0.422179 = fieldWeight in 4913, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.754864 = idf(docFreq=136, maxDocs=43254)
                0.0625 = fieldNorm(doc=4913)
          0.007877679 = weight(abstract_txt:information in 4913) [ClassicSimilarity], result of:
            0.007877679 = score(doc=4913,freq=1.0), product of:
              0.05193532 = queryWeight, product of:
                1.1698242 = boost
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.018293075 = queryNorm
              0.1516825 = fieldWeight in 4913, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.0625 = fieldNorm(doc=4913)
          0.032561 = weight(abstract_txt:retrieval in 4913) [ClassicSimilarity], result of:
            0.032561 = score(doc=4913,freq=2.0), product of:
              0.10616608 = queryWeight, product of:
                1.6725615 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.018293075 = queryNorm
              0.3066987 = fieldWeight in 4913, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.0625 = fieldNorm(doc=4913)
          0.051975954 = weight(abstract_txt:wird in 4913) [ClassicSimilarity], result of:
            0.051975954 = score(doc=4913,freq=3.0), product of:
              0.12667528 = queryWeight, product of:
                1.8269858 = boost
                3.7902684 = idf(docFreq=2655, maxDocs=43254)
                0.018293075 = queryNorm
              0.4103086 = fieldWeight in 4913, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.7902684 = idf(docFreq=2655, maxDocs=43254)
                0.0625 = fieldNorm(doc=4913)
          0.17198049 = weight(abstract_txt:dokumenten in 4913) [ClassicSimilarity], result of:
            0.17198049 = score(doc=4913,freq=3.0), product of:
              0.24572305 = queryWeight, product of:
                2.0776222 = boost
                6.4653587 = idf(docFreq=182, maxDocs=43254)
                0.018293075 = queryNorm
              0.6998956 = fieldWeight in 4913, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.4653587 = idf(docFreq=182, maxDocs=43254)
                0.0625 = fieldNorm(doc=4913)
          0.28822973 = weight(abstract_txt:modell in 4913) [ClassicSimilarity], result of:
            0.28822973 = score(doc=4913,freq=2.0), product of:
              0.50002617 = queryWeight, product of:
                4.19136 = boost
                6.5215535 = idf(docFreq=172, maxDocs=43254)
                0.018293075 = queryNorm
              0.5764293 = fieldWeight in 4913, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5215535 = idf(docFreq=172, maxDocs=43254)
                0.0625 = fieldNorm(doc=4913)
        0.24 = coord(6/25)
    
  4. Siebenlist, T.: MEMOSE. Spezialsuchmaschine für emotional geladene Dokumente (2012) 0.13
    0.12650649 = sum of:
      0.12650649 = product of:
        0.5271104 = sum of:
          0.07854099 = weight(abstract_txt:kurz in 1640) [ClassicSimilarity], result of:
            0.07854099 = score(doc=1640,freq=1.0), product of:
              0.12729958 = queryWeight, product of:
                1.0574068 = boost
                6.5810947 = idf(docFreq=162, maxDocs=43254)
                0.018293075 = queryNorm
              0.61697763 = fieldWeight in 1640, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5810947 = idf(docFreq=162, maxDocs=43254)
                0.09375 = fieldNorm(doc=1640)
          0.084928185 = weight(abstract_txt:indexierung in 1640) [ClassicSimilarity], result of:
            0.084928185 = score(doc=1640,freq=1.0), product of:
              0.13411085 = queryWeight, product of:
                1.0853269 = boost
                6.754864 = idf(docFreq=136, maxDocs=43254)
                0.018293075 = queryNorm
              0.63326854 = fieldWeight in 1640, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.754864 = idf(docFreq=136, maxDocs=43254)
                0.09375 = fieldNorm(doc=1640)
          0.011816518 = weight(abstract_txt:information in 1640) [ClassicSimilarity], result of:
            0.011816518 = score(doc=1640,freq=1.0), product of:
              0.05193532 = queryWeight, product of:
                1.1698242 = boost
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.018293075 = queryNorm
              0.22752374 = fieldWeight in 1640, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.09375 = fieldNorm(doc=1640)
          0.048841503 = weight(abstract_txt:retrieval in 1640) [ClassicSimilarity], result of:
            0.048841503 = score(doc=1640,freq=2.0), product of:
              0.10616608 = queryWeight, product of:
                1.6725615 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.018293075 = queryNorm
              0.46004808 = fieldWeight in 1640, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.09375 = fieldNorm(doc=1640)
          0.0450125 = weight(abstract_txt:wird in 1640) [ClassicSimilarity], result of:
            0.0450125 = score(doc=1640,freq=1.0), product of:
              0.12667528 = queryWeight, product of:
                1.8269858 = boost
                3.7902684 = idf(docFreq=2655, maxDocs=43254)
                0.018293075 = queryNorm
              0.35533768 = fieldWeight in 1640, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7902684 = idf(docFreq=2655, maxDocs=43254)
                0.09375 = fieldNorm(doc=1640)
          0.25797072 = weight(abstract_txt:dokumenten in 1640) [ClassicSimilarity], result of:
            0.25797072 = score(doc=1640,freq=3.0), product of:
              0.24572305 = queryWeight, product of:
                2.0776222 = boost
                6.4653587 = idf(docFreq=182, maxDocs=43254)
                0.018293075 = queryNorm
              1.0498434 = fieldWeight in 1640, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.4653587 = idf(docFreq=182, maxDocs=43254)
                0.09375 = fieldNorm(doc=1640)
        0.24 = coord(6/25)
    
  5. Markscheffel, B.: ¬Eine Entwurfsmethodik für Hypermedia-Systeme auf Basis des Spatial-Satellite-Modells S**2M (1993) 0.12
    0.1228141 = sum of:
      0.1228141 = product of:
        0.76758814 = sum of:
          0.1237678 = weight(abstract_txt:ausgehend in 3709) [ClassicSimilarity], result of:
            0.1237678 = score(doc=3709,freq=2.0), product of:
              0.1368226 = queryWeight, product of:
                1.0962447 = boost
                6.822815 = idf(docFreq=127, maxDocs=43254)
                0.018293075 = queryNorm
              0.90458596 = fieldWeight in 3709, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.822815 = idf(docFreq=127, maxDocs=43254)
                0.09375 = fieldNorm(doc=3709)
          0.063657284 = weight(abstract_txt:wird in 3709) [ClassicSimilarity], result of:
            0.063657284 = score(doc=3709,freq=2.0), product of:
              0.12667528 = queryWeight, product of:
                1.8269858 = boost
                3.7902684 = idf(docFreq=2655, maxDocs=43254)
                0.018293075 = queryNorm
              0.50252336 = fieldWeight in 3709, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7902684 = idf(docFreq=2655, maxDocs=43254)
                0.09375 = fieldNorm(doc=3709)
          0.14781852 = weight(abstract_txt:systemen in 3709) [ClassicSimilarity], result of:
            0.14781852 = score(doc=3709,freq=1.0), product of:
              0.2444886 = queryWeight, product of:
                2.0723968 = boost
                6.449098 = idf(docFreq=185, maxDocs=43254)
                0.018293075 = queryNorm
              0.60460293 = fieldWeight in 3709, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.449098 = idf(docFreq=185, maxDocs=43254)
                0.09375 = fieldNorm(doc=3709)
          0.4323446 = weight(abstract_txt:modell in 3709) [ClassicSimilarity], result of:
            0.4323446 = score(doc=3709,freq=2.0), product of:
              0.50002617 = queryWeight, product of:
                4.19136 = boost
                6.5215535 = idf(docFreq=172, maxDocs=43254)
                0.018293075 = queryNorm
              0.86464393 = fieldWeight in 3709, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5215535 = idf(docFreq=172, maxDocs=43254)
                0.09375 = fieldNorm(doc=3709)
        0.16 = coord(4/25)