Document (#34818)

Author
Puzicha, J.
Title
Informationen finden! : Intelligente Suchmaschinentechnologie & automatische Kategorisierung
Imprint
Rheinbach : recommind
Year
2007
Pages
15 S
Abstract
Wie in diesem Text erläutert wurde, ist die Effektivität von Such- und Klassifizierungssystemen durch folgendes bestimmt: 1) den Arbeitsauftrag, 2) die Genauigkeit des Systems, 3) den zu erreichenden Automatisierungsgrad, 4) die Einfachheit der Integration in bereits vorhandene Systeme. Diese Kriterien gehen davon aus, dass jedes System, unabhängig von der Technologie, in der Lage ist, Grundvoraussetzungen des Produkts in Bezug auf Funktionalität, Skalierbarkeit und Input-Methode zu erfüllen. Diese Produkteigenschaften sind in der Recommind Produktliteratur genauer erläutert. Von diesen Fähigkeiten ausgehend sollte die vorhergehende Diskussion jedoch einige klare Trends aufgezeigt haben. Es ist nicht überraschend, dass jüngere Entwicklungen im Maschine Learning und anderen Bereichen der Informatik einen theoretischen Ausgangspunkt für die Entwicklung von Suchmaschinen- und Klassifizierungstechnologie haben. Besonders jüngste Fortschritte bei den statistischen Methoden (PLSA) und anderen mathematischen Werkzeugen (SVMs) haben eine Ergebnisqualität auf Durchbruchsniveau erreicht. Dazu kommt noch die Flexibilität in der Anwendung durch Selbsttraining und Kategorienerkennen von PLSA-Systemen, wie auch eine neue Generation von vorher unerreichten Produktivitätsverbesserungen.
Content
Technical Whitepaper - Grundlagen der Informationsgewinnung
Footnote
Vgl. auch: http://www.recommind.de/?id=mindserver_categorization.
Theme
Automatisches Klassifizieren
Object
Latent Semantic Indexing

Similar documents (content)

  1. Dueck, G.: Wild duck : Empirische Philosophie der Mensch-Computer-Vernetzung (2004) 0.10
    0.10070875 = sum of:
      0.10070875 = product of:
        0.35967413 = sum of:
          0.06132691 = weight(abstract_txt:mathematischen in 653) [ClassicSimilarity], result of:
            0.06132691 = score(doc=653,freq=1.0), product of:
              0.16805783 = queryWeight, product of:
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.021587765 = queryNorm
              0.36491552 = fieldWeight in 653, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.046875 = fieldNorm(doc=653)
          0.019920219 = weight(abstract_txt:durch in 653) [ClassicSimilarity], result of:
            0.019920219 = score(doc=653,freq=1.0), product of:
              0.10005315 = queryWeight, product of:
                1.0911916 = boost
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.021587765 = queryNorm
              0.19909638 = fieldWeight in 653, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.046875 = fieldNorm(doc=653)
          0.03669523 = weight(abstract_txt:diese in 653) [ClassicSimilarity], result of:
            0.03669523 = score(doc=653,freq=3.0), product of:
              0.10424791 = queryWeight, product of:
                1.113831 = boost
                4.3355117 = idf(docFreq=1573, maxDocs=44218)
                0.021587765 = queryNorm
              0.35199967 = fieldWeight in 653, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.3355117 = idf(docFreq=1573, maxDocs=44218)
                0.046875 = fieldNorm(doc=653)
          0.041620433 = weight(abstract_txt:dass in 653) [ClassicSimilarity], result of:
            0.041620433 = score(doc=653,freq=3.0), product of:
              0.113378845 = queryWeight, product of:
                1.1615868 = boost
                4.5213976 = idf(docFreq=1306, maxDocs=44218)
                0.021587765 = queryNorm
              0.36709172 = fieldWeight in 653, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5213976 = idf(docFreq=1306, maxDocs=44218)
                0.046875 = fieldNorm(doc=653)
          0.0965985 = weight(abstract_txt:produkts in 653) [ClassicSimilarity], result of:
            0.0965985 = score(doc=653,freq=1.0), product of:
              0.22751233 = queryWeight, product of:
                1.163518 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.021587765 = queryNorm
              0.42458576 = fieldWeight in 653, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.046875 = fieldNorm(doc=653)
          0.063987486 = weight(abstract_txt:anderen in 653) [ClassicSimilarity], result of:
            0.063987486 = score(doc=653,freq=3.0), product of:
              0.15102805 = queryWeight, product of:
                1.340647 = boost
                5.2183776 = idf(docFreq=650, maxDocs=44218)
                0.021587765 = queryNorm
              0.42367947 = fieldWeight in 653, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.2183776 = idf(docFreq=650, maxDocs=44218)
                0.046875 = fieldNorm(doc=653)
          0.039525334 = weight(abstract_txt:haben in 653) [ClassicSimilarity], result of:
            0.039525334 = score(doc=653,freq=1.0), product of:
              0.18084873 = queryWeight, product of:
                1.7967556 = boost
                4.6624994 = idf(docFreq=1134, maxDocs=44218)
                0.021587765 = queryNorm
              0.21855466 = fieldWeight in 653, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6624994 = idf(docFreq=1134, maxDocs=44218)
                0.046875 = fieldNorm(doc=653)
        0.28 = coord(7/25)
    
  2. Burblies, C.; Wolff, J.E.: Vascoda - Effiziente Vermittlung wissenschaftlicher information (2009) 0.09
    0.0891055 = sum of:
      0.0891055 = product of:
        0.4455275 = sum of:
          0.032866683 = weight(abstract_txt:durch in 2783) [ClassicSimilarity], result of:
            0.032866683 = score(doc=2783,freq=2.0), product of:
              0.10005315 = queryWeight, product of:
                1.0911916 = boost
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.021587765 = queryNorm
              0.32849225 = fieldWeight in 2783, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2783)
          0.13669108 = weight(abstract_txt:suchmaschinentechnologie in 2783) [ClassicSimilarity], result of:
            0.13669108 = score(doc=2783,freq=2.0), product of:
              0.20537312 = queryWeight, product of:
                1.1054585 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.021587765 = queryNorm
              0.6655743 = fieldWeight in 2783, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2783)
          0.16765586 = weight(abstract_txt:einfachheit in 2783) [ClassicSimilarity], result of:
            0.16765586 = score(doc=2783,freq=2.0), product of:
              0.23532207 = queryWeight, product of:
                1.1833193 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.021587765 = queryNorm
              0.71245277 = fieldWeight in 2783, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2783)
          0.04310039 = weight(abstract_txt:anderen in 2783) [ClassicSimilarity], result of:
            0.04310039 = score(doc=2783,freq=1.0), product of:
              0.15102805 = queryWeight, product of:
                1.340647 = boost
                5.2183776 = idf(docFreq=650, maxDocs=44218)
                0.021587765 = queryNorm
              0.28538004 = fieldWeight in 2783, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2183776 = idf(docFreq=650, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2783)
          0.06521347 = weight(abstract_txt:haben in 2783) [ClassicSimilarity], result of:
            0.06521347 = score(doc=2783,freq=2.0), product of:
              0.18084873 = queryWeight, product of:
                1.7967556 = boost
                4.6624994 = idf(docFreq=1134, maxDocs=44218)
                0.021587765 = queryNorm
              0.3605968 = fieldWeight in 2783, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6624994 = idf(docFreq=1134, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2783)
        0.2 = coord(5/25)
    
  3. Raicher, E.: Möglichkeiten und Grenzen von Primo bei der Einführung in deutschsprachigen Bibliotheken und Bibliotheksverbünden (2010) 0.08
    0.08483671 = sum of:
      0.08483671 = product of:
        0.30298823 = sum of:
          0.037561923 = weight(abstract_txt:durch in 4311) [ClassicSimilarity], result of:
            0.037561923 = score(doc=4311,freq=8.0), product of:
              0.10005315 = queryWeight, product of:
                1.0911916 = boost
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.021587765 = queryNorm
              0.3754197 = fieldWeight in 4311, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.03125 = fieldNorm(doc=4311)
          0.055231538 = weight(abstract_txt:suchmaschinentechnologie in 4311) [ClassicSimilarity], result of:
            0.055231538 = score(doc=4311,freq=1.0), product of:
              0.20537312 = queryWeight, product of:
                1.1054585 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.021587765 = queryNorm
              0.26893264 = fieldWeight in 4311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.03125 = fieldNorm(doc=4311)
          0.019974355 = weight(abstract_txt:diese in 4311) [ClassicSimilarity], result of:
            0.019974355 = score(doc=4311,freq=2.0), product of:
              0.10424791 = queryWeight, product of:
                1.113831 = boost
                4.3355117 = idf(docFreq=1573, maxDocs=44218)
                0.021587765 = queryNorm
              0.19160436 = fieldWeight in 4311, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3355117 = idf(docFreq=1573, maxDocs=44218)
                0.03125 = fieldNorm(doc=4311)
          0.057087015 = weight(abstract_txt:überraschend in 4311) [ClassicSimilarity], result of:
            0.057087015 = score(doc=4311,freq=1.0), product of:
              0.20994736 = queryWeight, product of:
                1.1177015 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.021587765 = queryNorm
              0.27191108 = fieldWeight in 4311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.03125 = fieldNorm(doc=4311)
          0.042384177 = weight(abstract_txt:dass in 4311) [ClassicSimilarity], result of:
            0.042384177 = score(doc=4311,freq=7.0), product of:
              0.113378845 = queryWeight, product of:
                1.1615868 = boost
                4.5213976 = idf(docFreq=1306, maxDocs=44218)
                0.021587765 = queryNorm
              0.3738279 = fieldWeight in 4311, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.5213976 = idf(docFreq=1306, maxDocs=44218)
                0.03125 = fieldNorm(doc=4311)
          0.064399 = weight(abstract_txt:produkts in 4311) [ClassicSimilarity], result of:
            0.064399 = score(doc=4311,freq=1.0), product of:
              0.22751233 = queryWeight, product of:
                1.163518 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.021587765 = queryNorm
              0.28305718 = fieldWeight in 4311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.03125 = fieldNorm(doc=4311)
          0.026350223 = weight(abstract_txt:haben in 4311) [ClassicSimilarity], result of:
            0.026350223 = score(doc=4311,freq=1.0), product of:
              0.18084873 = queryWeight, product of:
                1.7967556 = boost
                4.6624994 = idf(docFreq=1134, maxDocs=44218)
                0.021587765 = queryNorm
              0.1457031 = fieldWeight in 4311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6624994 = idf(docFreq=1134, maxDocs=44218)
                0.03125 = fieldNorm(doc=4311)
        0.28 = coord(7/25)
    
  4. Weishaupt, K.: Alephino : ein neues Bibliothekssystem für kleine und mittlere Bibliotheken (2004) 0.08
    0.08426965 = sum of:
      0.08426965 = product of:
        0.35112354 = sum of:
          0.07387176 = weight(abstract_txt:funktionalität in 2286) [ClassicSimilarity], result of:
            0.07387176 = score(doc=2286,freq=1.0), product of:
              0.17167714 = queryWeight, product of:
                1.0107107 = boost
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.021587765 = queryNorm
              0.4302947 = fieldWeight in 2286, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2286)
          0.089917764 = weight(abstract_txt:vorher in 2286) [ClassicSimilarity], result of:
            0.089917764 = score(doc=2286,freq=1.0), product of:
              0.19571488 = queryWeight, product of:
                1.0791519 = boost
                8.401051 = idf(docFreq=26, maxDocs=44218)
                0.021587765 = queryNorm
              0.45943245 = fieldWeight in 2286, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.401051 = idf(docFreq=26, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2286)
          0.024717001 = weight(abstract_txt:diese in 2286) [ClassicSimilarity], result of:
            0.024717001 = score(doc=2286,freq=1.0), product of:
              0.10424791 = queryWeight, product of:
                1.113831 = boost
                4.3355117 = idf(docFreq=1573, maxDocs=44218)
                0.021587765 = queryNorm
              0.23709829 = fieldWeight in 2286, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3355117 = idf(docFreq=1573, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2286)
          0.039646767 = weight(abstract_txt:dass in 2286) [ClassicSimilarity], result of:
            0.039646767 = score(doc=2286,freq=2.0), product of:
              0.113378845 = queryWeight, product of:
                1.1615868 = boost
                4.5213976 = idf(docFreq=1306, maxDocs=44218)
                0.021587765 = queryNorm
              0.349684 = fieldWeight in 2286, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5213976 = idf(docFreq=1306, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2286)
          0.04310039 = weight(abstract_txt:anderen in 2286) [ClassicSimilarity], result of:
            0.04310039 = score(doc=2286,freq=1.0), product of:
              0.15102805 = queryWeight, product of:
                1.340647 = boost
                5.2183776 = idf(docFreq=650, maxDocs=44218)
                0.021587765 = queryNorm
              0.28538004 = fieldWeight in 2286, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2183776 = idf(docFreq=650, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2286)
          0.07986987 = weight(abstract_txt:haben in 2286) [ClassicSimilarity], result of:
            0.07986987 = score(doc=2286,freq=3.0), product of:
              0.18084873 = queryWeight, product of:
                1.7967556 = boost
                4.6624994 = idf(docFreq=1134, maxDocs=44218)
                0.021587765 = queryNorm
              0.44163907 = fieldWeight in 2286, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6624994 = idf(docFreq=1134, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2286)
        0.24 = coord(6/25)
    
  5. Donsbach, W.: Wahrheit in den Medien : über den Sinn eines methodischen Objektivitätsbegriffes (2001) 0.08
    0.07527337 = sum of:
      0.07527337 = product of:
        0.31363904 = sum of:
          0.10415431 = weight(abstract_txt:klare in 5895) [ClassicSimilarity], result of:
            0.10415431 = score(doc=5895,freq=1.0), product of:
              0.19747724 = queryWeight, product of:
                1.0839996 = boost
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.021587765 = queryNorm
              0.5274244 = fieldWeight in 5895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.0625 = fieldNorm(doc=5895)
          0.026560292 = weight(abstract_txt:durch in 5895) [ClassicSimilarity], result of:
            0.026560292 = score(doc=5895,freq=1.0), product of:
              0.10005315 = queryWeight, product of:
                1.0911916 = boost
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.021587765 = queryNorm
              0.26546183 = fieldWeight in 5895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.0625 = fieldNorm(doc=5895)
          0.048926976 = weight(abstract_txt:diese in 5895) [ClassicSimilarity], result of:
            0.048926976 = score(doc=5895,freq=3.0), product of:
              0.10424791 = queryWeight, product of:
                1.113831 = boost
                4.3355117 = idf(docFreq=1573, maxDocs=44218)
                0.021587765 = queryNorm
              0.4693329 = fieldWeight in 5895, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.3355117 = idf(docFreq=1573, maxDocs=44218)
                0.0625 = fieldNorm(doc=5895)
          0.032039426 = weight(abstract_txt:dass in 5895) [ClassicSimilarity], result of:
            0.032039426 = score(doc=5895,freq=1.0), product of:
              0.113378845 = queryWeight, product of:
                1.1615868 = boost
                4.5213976 = idf(docFreq=1306, maxDocs=44218)
                0.021587765 = queryNorm
              0.28258735 = fieldWeight in 5895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5213976 = idf(docFreq=1306, maxDocs=44218)
                0.0625 = fieldNorm(doc=5895)
          0.049257588 = weight(abstract_txt:anderen in 5895) [ClassicSimilarity], result of:
            0.049257588 = score(doc=5895,freq=1.0), product of:
              0.15102805 = queryWeight, product of:
                1.340647 = boost
                5.2183776 = idf(docFreq=650, maxDocs=44218)
                0.021587765 = queryNorm
              0.3261486 = fieldWeight in 5895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2183776 = idf(docFreq=650, maxDocs=44218)
                0.0625 = fieldNorm(doc=5895)
          0.052700445 = weight(abstract_txt:haben in 5895) [ClassicSimilarity], result of:
            0.052700445 = score(doc=5895,freq=1.0), product of:
              0.18084873 = queryWeight, product of:
                1.7967556 = boost
                4.6624994 = idf(docFreq=1134, maxDocs=44218)
                0.021587765 = queryNorm
              0.2914062 = fieldWeight in 5895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6624994 = idf(docFreq=1134, maxDocs=44218)
                0.0625 = fieldNorm(doc=5895)
        0.24 = coord(6/25)