Document (#18733)

Baumgarten, C.
Probabilistische Modellierung der effizienten Informationssuche in verteilten multimedialen Dokumentbeständen durch Einschränkung des Suchraums
Hypertext - Information Retrieval - Multimedia '97: Theorien, Modelle und Implementierungen integrierter elektronischer Informationssysteme. Proceedings HIM '97. Hrsg.: N. Fuhr u.a
Konstanz : Universitätsverlag
Schriften zur Informationswissenschaft; Bd.30
Ein Modell für die Informationssuche in einer verteilten Multimedia-Dokumentkollektion wird vorgestellt. Das Modell basiert auf dem probabilistischen Anordnungsprinzip. NAch der Berechnung individueller Ranglisten zu den einzelnen Subkollektionen werden diese schrittweise in eine finale Rangliste überführt, in der die Dokumente gemäß ihrer Relevanzwahrscheinlichkeiten geordnet sind. Dabei können die Dokumente (bzw. Dokumentpassagen, falls es sich um multimediale Dokumente handelt) aus verschiedenen Subkollektionen mit verschiedenen Verfahren indexiert werden. Auch lassen sich unterschiedliche probabilistische Verfahren zur Berechnung der subkollektionsspezifischen Ranglisten einsetzen. Damit wird die Integration von Dokumenten beliebigen Typs unterstützt. Übredies ist das zugrundeliegende Datenvolumen beliebig skalierbar. Das Modell wird durch ein Kriterium zur Einschränkung des Suchraums erweitert, um die effiziente Informationssuche zu ermöglichen. Dabei werden verschiedene Kostenfaktoren berücksichtigt

Similar documents (content)

  1. Panyr, J.: Probabilistische Modelle in Information-Retrieval-Systemen (1986) 0.16
    0.16167206 = sum of:
      0.16167206 = product of:
        0.8083603 = sum of:
          0.041530173 = weight(abstract_txt:dabei in 1460) [ClassicSimilarity], result of:
            0.041530173 = score(doc=1460,freq=1.0), product of:
              0.081003994 = queryWeight, product of:
                1.1255001 = boost
                4.687478 = idf(docFreq=1106, maxDocs=44218)
                0.015354005 = queryNorm
              0.5126929 = fieldWeight in 1460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.687478 = idf(docFreq=1106, maxDocs=44218)
                0.109375 = fieldNorm(doc=1460)
          0.31063896 = weight(abstract_txt:probabilistischen in 1460) [ClassicSimilarity], result of:
            0.31063896 = score(doc=1460,freq=3.0), product of:
              0.17049728 = queryWeight, product of:
                1.154612 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.015354005 = queryNorm
              1.8219584 = fieldWeight in 1460, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.109375 = fieldNorm(doc=1460)
          0.026071617 = weight(abstract_txt:werden in 1460) [ClassicSimilarity], result of:
            0.026071617 = score(doc=1460,freq=1.0), product of:
              0.06798394 = queryWeight, product of:
                1.2628189 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.015354005 = queryNorm
              0.3834967 = fieldWeight in 1460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.109375 = fieldNorm(doc=1460)
          0.0562755 = weight(abstract_txt:wird in 1460) [ClassicSimilarity], result of:
            0.0562755 = score(doc=1460,freq=3.0), product of:
              0.07872878 = queryWeight, product of:
                1.3589538 = boost
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.015354005 = queryNorm
              0.71480215 = fieldWeight in 1460, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.109375 = fieldNorm(doc=1460)
          0.37384403 = weight(abstract_txt:probabilistische in 1460) [ClassicSimilarity], result of:
            0.37384403 = score(doc=1460,freq=1.0), product of:
              0.35052922 = queryWeight, product of:
                2.3412857 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.015354005 = queryNorm
              1.0665132 = fieldWeight in 1460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.109375 = fieldNorm(doc=1460)
        0.2 = coord(5/25)
  2. Seelbach, H.E.: Von der Stichwortliste zum halbautomatisch kontrollierten Wortschatz (1977) 0.11
    0.10649576 = sum of:
      0.10649576 = product of:
        0.5324788 = sum of:
          0.16376843 = weight(abstract_txt:überführt in 8950) [ClassicSimilarity], result of:
            0.16376843 = score(doc=8950,freq=1.0), product of:
              0.14680678 = queryWeight, product of:
                1.0713968 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.015354005 = queryNorm
              1.1155373 = fieldWeight in 8950, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.125 = fieldNorm(doc=8950)
          0.21362516 = weight(abstract_txt:dokumentbeständen in 8950) [ClassicSimilarity], result of:
            0.21362516 = score(doc=8950,freq=1.0), product of:
              0.17526461 = queryWeight, product of:
                1.1706429 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.015354005 = queryNorm
              1.2188722 = fieldWeight in 8950, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.125 = fieldNorm(doc=8950)
          0.029796135 = weight(abstract_txt:werden in 8950) [ClassicSimilarity], result of:
            0.029796135 = score(doc=8950,freq=1.0), product of:
              0.06798394 = queryWeight, product of:
                1.2628189 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.015354005 = queryNorm
              0.43828195 = fieldWeight in 8950, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.125 = fieldNorm(doc=8950)
          0.037132204 = weight(abstract_txt:wird in 8950) [ClassicSimilarity], result of:
            0.037132204 = score(doc=8950,freq=1.0), product of:
              0.07872878 = queryWeight, product of:
                1.3589538 = boost
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.015354005 = queryNorm
              0.4716471 = fieldWeight in 8950, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.125 = fieldNorm(doc=8950)
          0.088156864 = weight(abstract_txt:verfahren in 8950) [ClassicSimilarity], result of:
            0.088156864 = score(doc=8950,freq=1.0), product of:
              0.12239774 = queryWeight, product of:
                1.3834995 = boost
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.015354005 = queryNorm
              0.7202491 = fieldWeight in 8950, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.125 = fieldNorm(doc=8950)
        0.2 = coord(5/25)
  3. Thiel, M.: Bedingt wahrscheinliche Syntaxbäume (2006) 0.10
    0.10422546 = sum of:
      0.10422546 = product of:
        0.37223378 = sum of:
          0.0132414475 = weight(abstract_txt:durch in 6069) [ClassicSimilarity], result of:
            0.0132414475 = score(doc=6069,freq=1.0), product of:
              0.06650773 = queryWeight, product of:
                1.0198313 = boost
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.015354005 = queryNorm
              0.19909638 = fieldWeight in 6069, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.046875 = fieldNorm(doc=6069)
          0.07686321 = weight(abstract_txt:probabilistischen in 6069) [ClassicSimilarity], result of:
            0.07686321 = score(doc=6069,freq=1.0), product of:
              0.17049728 = queryWeight, product of:
                1.154612 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.015354005 = queryNorm
              0.45081776 = fieldWeight in 6069, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.046875 = fieldNorm(doc=6069)
          0.02498482 = weight(abstract_txt:werden in 6069) [ClassicSimilarity], result of:
            0.02498482 = score(doc=6069,freq=5.0), product of:
              0.06798394 = queryWeight, product of:
                1.2628189 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.015354005 = queryNorm
              0.36751062 = fieldWeight in 6069, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.046875 = fieldNorm(doc=6069)
          0.029758494 = weight(abstract_txt:verschiedenen in 6069) [ClassicSimilarity], result of:
            0.029758494 = score(doc=6069,freq=1.0), product of:
              0.114109665 = queryWeight, product of:
                1.3358371 = boost
                5.563489 = idf(docFreq=460, maxDocs=44218)
                0.015354005 = queryNorm
              0.26078856 = fieldWeight in 6069, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.563489 = idf(docFreq=460, maxDocs=44218)
                0.046875 = fieldNorm(doc=6069)
          0.034108106 = weight(abstract_txt:wird in 6069) [ClassicSimilarity], result of:
            0.034108106 = score(doc=6069,freq=6.0), product of:
              0.07872878 = queryWeight, product of:
                1.3589538 = boost
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.015354005 = queryNorm
              0.43323553 = fieldWeight in 6069, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.046875 = fieldNorm(doc=6069)
          0.033058822 = weight(abstract_txt:verfahren in 6069) [ClassicSimilarity], result of:
            0.033058822 = score(doc=6069,freq=1.0), product of:
              0.12239774 = queryWeight, product of:
                1.3834995 = boost
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.015354005 = queryNorm
              0.2700934 = fieldWeight in 6069, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.046875 = fieldNorm(doc=6069)
          0.16021888 = weight(abstract_txt:probabilistische in 6069) [ClassicSimilarity], result of:
            0.16021888 = score(doc=6069,freq=1.0), product of:
              0.35052922 = queryWeight, product of:
                2.3412857 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.015354005 = queryNorm
              0.4570771 = fieldWeight in 6069, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.046875 = fieldNorm(doc=6069)
        0.28 = coord(7/25)
  4. Enderle, W.: Auf dem Weg zur digitalen Bibliothek : Projekte in Deutschland (1997) 0.10
    0.10411803 = sum of:
      0.10411803 = product of:
        0.43382514 = sum of:
          0.022069078 = weight(abstract_txt:durch in 1650) [ClassicSimilarity], result of:
            0.022069078 = score(doc=1650,freq=1.0), product of:
              0.06650773 = queryWeight, product of:
                1.0198313 = boost
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.015354005 = queryNorm
              0.33182728 = fieldWeight in 1650, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.078125 = fieldNorm(doc=1650)
          0.029664408 = weight(abstract_txt:dabei in 1650) [ClassicSimilarity], result of:
            0.029664408 = score(doc=1650,freq=1.0), product of:
              0.081003994 = queryWeight, product of:
                1.1255001 = boost
                4.687478 = idf(docFreq=1106, maxDocs=44218)
                0.015354005 = queryNorm
              0.3662092 = fieldWeight in 1650, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.687478 = idf(docFreq=1106, maxDocs=44218)
                0.078125 = fieldNorm(doc=1650)
          0.032255262 = weight(abstract_txt:werden in 1650) [ClassicSimilarity], result of:
            0.032255262 = score(doc=1650,freq=3.0), product of:
              0.06798394 = queryWeight, product of:
                1.2628189 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.015354005 = queryNorm
              0.47445413 = fieldWeight in 1650, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.078125 = fieldNorm(doc=1650)
          0.040196788 = weight(abstract_txt:wird in 1650) [ClassicSimilarity], result of:
            0.040196788 = score(doc=1650,freq=3.0), product of:
              0.07872878 = queryWeight, product of:
                1.3589538 = boost
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.015354005 = queryNorm
              0.51057297 = fieldWeight in 1650, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.078125 = fieldNorm(doc=1650)
          0.12655947 = weight(abstract_txt:verteilten in 1650) [ClassicSimilarity], result of:
            0.12655947 = score(doc=1650,freq=1.0), product of:
              0.2130815 = queryWeight, product of:
                1.8254299 = boost
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.015354005 = queryNorm
              0.59394866 = fieldWeight in 1650, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.078125 = fieldNorm(doc=1650)
          0.18308015 = weight(abstract_txt:dokumente in 1650) [ClassicSimilarity], result of:
            0.18308015 = score(doc=1650,freq=3.0), product of:
              0.21632174 = queryWeight, product of:
                2.2526205 = boost
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.015354005 = queryNorm
              0.84633267 = fieldWeight in 1650, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.078125 = fieldNorm(doc=1650)
        0.24 = coord(6/25)
  5. Roth, A.: Modellierung und Anwendung von Ontologien am Beispiel "Operations Research & Management Science" (2002) 0.10
    0.095954984 = sum of:
      0.095954984 = product of:
        0.34269637 = sum of:
          0.024968311 = weight(abstract_txt:durch in 5011) [ClassicSimilarity], result of:
            0.024968311 = score(doc=5011,freq=2.0), product of:
              0.06650773 = queryWeight, product of:
                1.0198313 = boost
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.015354005 = queryNorm
              0.3754197 = fieldWeight in 5011, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.0625 = fieldNorm(doc=5011)
          0.023731528 = weight(abstract_txt:dabei in 5011) [ClassicSimilarity], result of:
            0.023731528 = score(doc=5011,freq=1.0), product of:
              0.081003994 = queryWeight, product of:
                1.1255001 = boost
                4.687478 = idf(docFreq=1106, maxDocs=44218)
                0.015354005 = queryNorm
              0.29296738 = fieldWeight in 5011, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.687478 = idf(docFreq=1106, maxDocs=44218)
                0.0625 = fieldNorm(doc=5011)
          0.039416578 = weight(abstract_txt:werden in 5011) [ClassicSimilarity], result of:
            0.039416578 = score(doc=5011,freq=7.0), product of:
              0.06798394 = queryWeight, product of:
                1.2628189 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.015354005 = queryNorm
              0.5797925 = fieldWeight in 5011, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.0625 = fieldNorm(doc=5011)
          0.039677992 = weight(abstract_txt:verschiedenen in 5011) [ClassicSimilarity], result of:
            0.039677992 = score(doc=5011,freq=1.0), product of:
              0.114109665 = queryWeight, product of:
                1.3358371 = boost
                5.563489 = idf(docFreq=460, maxDocs=44218)
                0.015354005 = queryNorm
              0.34771806 = fieldWeight in 5011, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.563489 = idf(docFreq=460, maxDocs=44218)
                0.0625 = fieldNorm(doc=5011)
          0.018566102 = weight(abstract_txt:wird in 5011) [ClassicSimilarity], result of:
            0.018566102 = score(doc=5011,freq=1.0), product of:
              0.07872878 = queryWeight, product of:
                1.3589538 = boost
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.015354005 = queryNorm
              0.23582356 = fieldWeight in 5011, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.0625 = fieldNorm(doc=5011)
          0.10124757 = weight(abstract_txt:verteilten in 5011) [ClassicSimilarity], result of:
            0.10124757 = score(doc=5011,freq=1.0), product of:
              0.2130815 = queryWeight, product of:
                1.8254299 = boost
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.015354005 = queryNorm
              0.47515893 = fieldWeight in 5011, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.0625 = fieldNorm(doc=5011)
          0.09508826 = weight(abstract_txt:modell in 5011) [ClassicSimilarity], result of:
            0.09508826 = score(doc=5011,freq=1.0), product of:
              0.23392195 = queryWeight, product of:
                2.3424666 = boost
                6.5039306 = idf(docFreq=179, maxDocs=44218)
                0.015354005 = queryNorm
              0.40649566 = fieldWeight in 5011, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5039306 = idf(docFreq=179, maxDocs=44218)
                0.0625 = fieldNorm(doc=5011)
        0.28 = coord(7/25)